AMERICAN 
JOURNAL OF MATHEMATICS 


FOUNDED BY THE JOHNS HOPKINS UNIVERSITY 


EDITED BY 
S. EILENBERG 
COLUMBIA UNIVERSITY 


A. WINTNER 
THE JOHNS HOPKINS UNIVERSITY 


R. BAER 
UNIVERSITY OF ILLINOIS 


D. C. LEWIS, JR. 
THE JOHNS HOPKINS UNIVERSITY 
WITH THE COOPERATION OF 


L. V. AHLFORS P. HARTMAN 

S. S. CHERN G. P. HOCHSCHILD 
W. L. CHOW I. KAPLANSKY 

P. R. HALMOS W. S. MASSEY 


PUBLISHED UNDER THE JOINT AUSPICES OF 


THE JOHNS HOPKINS UNIVERSITY 
AND 


THE AMERICAN MATHEMATICAL SOCIETY 


Volume LXXIV, Number 3 
JULY, 1952 


THEMATIES 
LIBRARY 


THE JOHNS HOPKINS PRESS 
A BALTIMORE 18, MARYLAND 
U. S. A. 


H. SAMELSON 
R. M. THRALL 
A. D. WALLACE 
A. WEIL 
JUL 22.1952 


CONTENTS 


PAGE 
On the cohomology theory for associative algebras. By I. H. Rosz, . 531 
On related periodic maps. By E. E. Fuoyp, . ; ; , . 547 
Topology of metric complexes. By C. H. Dowkrr, . : ‘ . 555 
On the unboundedness of the essential spectrum. By C. R. Putnam, . 578 
Properties of conformal invariants. By Vipar WOLONTIS, . ; . 587 


On geodesic torsions and parabolic and asymptotic curves. By PHILIP 
HarTMAN and AUREL WINTNER, . . 607 


On the theory of geodesic fields. By Puitip Hartman and AUREL 


Note on double-modules over arbitrary rings. By TapasI NAKAYAMA, . 645 


Order and topology in projective planes. By OswaLtp WYLER, . . 656 


Means in groups. By W. R. Scort, 667 
A proof of the maximal chain theorem. By Orrin FRINK, . ‘ . 676 
Notes on left division systems with left unit. By M. F. Smiuey, . - 679 


A characterization of finite dimensional convex sets. By E. G. Straus 


and F. A. VALENTINE, . ‘ 683 
On additive ideal theory in general rings. By CHartes W. Curtis, . 687 


Two decomposition theorems for a class of finite oriented graphs. By 


On the non-vanishing of certain Dirichlet series. By AUREL WINTNER, 723 
On the fundamental group of an algebraic variety. By WzI-Liana CHow, 726 


Induced representations. By F. I. MAUTNER, . ‘ 


The AMERICAN JOURNAL OF MATHEMATICS will appear four times yearly. 

The subscription price of the JouRNAL for the current volume is $7.50 (foreign 
postage 50 cents); single numbers $2.00. 

A few complete sets of the JoURNAL remain on sale. 

Papers intended for publication in the JoURNAL may be sent to any of the Editors. 

Editorial communications should be sent to Professor AUREL WINTNER at The Johns 
Hopkins University. 

Subscriptions to the JoURNAL and all business communications should be sent to 
Tue JoHNs HopkKINs Press, BALTIMORE 18, MARYLAND, U.S. A. 


Entered as second-class matter at the Baltimore, Maryland, Postoffice, acceptance for mailing at special 
rate of postage provided for in Section 1103, Act of October 8, 1917, Authorized on July 8, 1918. 


PRINTED IN THE UNITED STATES OF AMERICA 
BY J. H. FURST COMPANY, BALTIMORE, MARYLAND 


| 
1 | 
v4 
5 \ 
8 
6 
5 
6 
Y 
6 
9 
33 
37 
)1 
16 
37 

8. 
ns 

to 
ia! 


— 


9 
e 
rf 
‘ 
- 
| 
| 
! * 
2 

: 

' 


ON THE COHOMOLOGY THEORY FOR ASSOCIATIVE 
ALGEBRAS.* 


By I. H. Ross. 


1. Introduction. Cohomology theory in general (cf. [1]) concerns 
itself with the following situation, which arose originally in topology: 


C°,C1,- - -,C%,- + 48 sequence of abelian groups, 8°, - -,8",--- 
a sequence of homomorphisms such that & maps C” into C™** and such that 
0), 


In this situation the elements of C” are called n-dimensional cochains ; 
the kernel of 6” is denoted Z” and its elements are called n-dimensional co- 
cycles; the image of 8” is denoted B”** and its elements are called (n+ 1)- 
dimensional coboundaries; B® is defined to be the zero element of C°. 

Since 618" — 0, it follows that B"CZ". We may therefore define the 
“cohomology ” group H"=Z"/B"; two n-dimensional cocycles are called 
cohomologous if their difference is a coboundary. The symbol 6 is used to 
represent any of the homomorphisms 8", since this leads to no ambiguity. 

The cohomology theory for associative algebras specializes the general 
situation as follows. 

Let F be a field over which are defined an associative algebra A and a 
vector space P. Suppose further that P is a two-sided A-module, i. e. for each 
ae A, pe P there are defined elements a- p, p-aeP which are bilinear func- 
tions of a and p such that 


Now for n > 0 define C" = C"(A, P) as the vector space of all n-linear 


functions of n variables mapping A into P, and define C° = C°(A, P) =P. 
Furthermore, for n > 0 and fe C"(A, P) define 8f such that: 


n~-1 


+ (— 1)"f(4a1, * * 5 AnOn+1) (— 1)"f(a,, Gn) * 
Finally, for n = 0, define 8p(a) =a: p—p-a. 


* Received May 16, 1951. 
+ This paper is based on a portion of the author’s doctoral dissertation at Harvard 
University. 


531 


| 
i 


532 I. H. ROSE. 


It is then easily proved (see [4], p. 60) that 8 is a vector space homo- 
morphism such that 88 = 0, so that we have here a special case of the preceding 
general situation. In this case we denote Z" by Z"(A, P), B™ by B"(A, P) etc. 

This paper is concerned with several problems suggested by Hochschild. 
We consider first a problem proposed in [4]. After introducing for a given 
algebra the condition C,: “All m-dimensional cohomology groups vanish,” 
the statement is made (p. 58) “Theorem 3.1 implies that Cm. is a conse- 
quence of C,, for m= 1. But it is an an open question whether or not C,, and 


are equivalent.” 

We denote by Km, m=—0,1,2,--- the class of all algebras over F 
whose m-dimensional cohomology groups are all zero, or in other words 
cocycles are all coboundaries. (It is easily proved that K, is the null class.) 
One may then paraphrase the question concerning the condition C,, as follows: 
For m = 0 we have Km C Km; do we actually have Km = Ku? 

For m =0,1,2 Hochschild has proved that the answer to this question 
is in the negative. The proof for m —0 follows from his demonstration that 
the algebras in K, are a well-known non-null set, namely the set of algebras 
separable over F ([4], Theorem 4.1). For m1 an algebra (which we 
denote H’) is produced such that H’ e K,— K, ([4], section 9). The case 
m = 2 is disposed of by first proving that adjoining an identity to an algebra 
does not affect its K-class ([5], section 2); then, letting H be the algebra 
formed by adjoining an identity to H’, it is proved that the Kronecker product 
HX He K;— Ky, ([6],Theorem 9. 2). 

Immediately following Hochschild’s proof of this last result he states a 
conjecture which we shall call Conjecture 1. 


ConsEctuRE 1. The n-fold Kronecker product HX H X:-:-X H does 
not belong to Ky. 


We prove Conjecture 1 in Prop. 6.1. There easily follows (Theorem 6. 1) 
the answer to the first question raised, namely: Km Km.i; for although the 
n-fold product H XK H X-- - X H does not belong to Kn, it does, as a conse- 
quence of Theorem 5.1, belong to Kaj. 

Conjecture 1 is followed in [6] by another conjecture: 


ConsEcTuRE 2. If Ae K,— Ky. and Be Kyg— Kg, then the Kronecker 
product A Be — Kpsq-2. 


We shall consider in this paper the following two conjectures which 
together imply Conjecture 2. 


a 
Px 
{ 


COHOMOLOGY FOR ASSOCIATIVE ALGEBRAS. 


CoNJECTURE 2a. Ace K, Be AX 
ConJECTURE 2b. A#K,, AXBE 


A special case of Conjecture 2a, namely that in which A and B have 
identities and q = 2, is a consequence of the following theorem in [6]: 


THEOREM 9.1. Let A and B be algebras with identity elements, and 
suppose that all two-dimensional cohomology groups of B are zero. Then 
for every A X B module Q and n= 1, we have 


H(A, Z°(B,Ci(A X B,Q))] = H(A X B,Q). 
In [6] the proof of Theorem 9.1 is followed by another conjecture: 
ConsEcTuRE 3. If A and B have identities, and Be Kp,.:, then 


H*[ A, Z°(B, X B,Q))] = H™?(A Q). 


We prove Conjecture 3 in Theorem 4.1. There immediately follows 
(Theorem 5.1) that Conjecture 2a is true for algebras with identities. 
(Conjectures 2b and 2 are definitely not true for algebras A, B without iden- 
tities; a counterexample will be exhibited in a subsequent paper on the 
classification of algebras by means of cohomology theory). For the case 
p= 0, A with an identity we prove Conjecture 2b in Cor. 5.1, while in the 


case p—1, A and B with identities, a proof is given in Theorem 5.2. It is 
likely that Conjecture 2b is true for algebras A, B with identities, but a proof 
covering all cases remains to be found. 


2. Notations and conventions. Except where otherwise indicated, the 
following notation and conventions will be assumed. 

F: A field over which all algebras in the sequel are defined; all algebras in 
the sequel have identities. 

A X B: The Kronecker product of algebras A, B. If 1a, 1g are the identities 
of A,B respectively, we identify aX 1g with ae A and 1, X b with 
be B in situations where these identifications lead to no ambiguity ; 
also, we denote 1a X 1g by 1. 

S;: The element a; K b:e€A X B. 

rls: The sequence dy, Qs, TSS. 

[,: The class of algebras Ky,, Ky, p=0,1,2,-- note that L, Ky. 

3. The D-modules M, and M’,. (n=0,1,2,:~--). Let D be an 


algebra, M a D-module. We shall find it useful to make the vector space 
C"(D, M) into a D-module in two ways. 


{ 
533 
| 
4 
a 
a 
4 


I. H. ROSE. 


(1) For n>0, C"(D,M) ts made into a D-module M, as follows: 
Let d,idneD, ge C"(D,M). Then we define d-g and g-d such that 
{d-g}(:dn) = d+ g(idn) and {g-d}(sdn) —d- g(sdn) — 89(d, dn). 
(II) For n>0, C"(D,M) is made into a D-module M’, as follows: 
Let d,id,eD, geC"(D,M). Then we define d-g and g-d such that 
g}(sdn) = (— 1)"89(1dn, d) + and {g-d}(:dn) = d. 
The definitions are completed by setting My>—= M’, = M; to verify that 


M,, M’, actually are D-modules involves only straightforward computation. 
The usefulness of these modules lies in the following two propositions. 


_ Proposition 3.1. Given feC™"(D,M), m=0, n>0, define 
feO™(D,Mn) such that f(:dn) =f(sdn) for m=0, 
f (1dmsn) for m > 0. Then {8f (1dms1) } ) Sf 


Proof. Note first that for m > 0, 


{f(1dm) dines ‘ {f (.dm) } ) 


Therefore, for m > 0, 

For m = 0 the result follows immediately from (I). 

Proposition 3.2. Given feC™"(D,M), m=0, n>O0, define 
feC™(D, M’,) such that =(—1)"f(1dn) for m =0, {f(nsrdnsm) } 
= (—1)"f(:dmin) for m > 0. 

Then = 

Proof. Note first that for m > 0, 


{dna (adn) 
= + }(2dn) * Ones 
(—1)"[ ds - —* 
+ (— Andns1) | 
== d;° f(2dmsns1) (— 1)"f(1dn-s, 


534 


COHOMOLOGY FOR ASSOCIATIVE ALGEBRAS. 


Therefore, for m > 0, 


}(:dn) 
For m = 0 the result follows immediately from (II). 


3.1. H™"(D,M) =H™(D,M,) for m>0, n=0. 
3.2. H™"(D,M) =H"(D,M’,) for m>0, n= 0. 


4, The cohomology group of a Kronecker product. 
LemMA 4.1. Suppose p>0, fe B,M) and that 
(i) f is a coboundary on B, 1. e. there exists ge C?(B,M) such that 
{f — 89} = 0. 
(ii) i>p, Sie B= = 0. 
Then there exists he C?(A X B,M) such that 
i>p, Se B= {f — 8h} (Spi) = 0. 
Proof. Adopting the notation 
[8f(1Sp2) ]x: the sum of the first & terms of the expansion of Sf (1Sp.2), 
1[8f(1Sp.2) ]: the sum of the /-th term and its successors in the expansion of 
8f (1Sps2), 
Qnp: the product ‘dp, (nSp), 
and defining the cochains 
U(iSp) = 9(1bp) —f(Aip, bp), 
= (—1)?f 1), 
fr(sSp) = f (151-2, Arp; rbp), (l<rSp), 


we derive the following relations: 


(1). 8u(S,, b2) = 8, u(be) — u(Sib2) + u(S1) be 
= 8,-9(b2) — 81° f(1, b2) — g(bibe) + f(ai, b1b2) 
+ a+ 9(b1) bs — b1) be 
= be) — f(1, b2)] + be 
= 8f(a1, be) + f(S1, b2) — [8f(S1, 1, 62) — f(S1, 1) be] 
= f(S:, b2) — f(S1, 1) be. (p= 1) 


535 

é 


536 I. H. ROSE. 


(2) bpsr) = W(2Sp, — U(SiS2, + 

+ (— 1)?u( Spr, Spbps1) — (— 1)?u(aSp) 

= — Aip* 9(b1b2, + °° 

+ f(aip, 612, sbps1) —* *— (—1)?f (Arp, 1Dp-1, bpbps1) 

+ (—1)?f(aip, bps 

= dip * 89(1bps1) — S1* f(dep, 2b ps1) + Sf (Grp, 1b 

= 2bps1) — f(Gep, 2bp41). (p > 1) 


(3). 8v(iSp, = Si v(28p, — Sp, 
= (— 1)?[S1- f (Sp, 1) — (S182, pSs, bp, 1) + 
+ (—1)?f (Spa, 1) — (—1)?f(18p, 1) 
= (— 1) + (— 1)?fGSp, — (— 1 
—(— 1)?f(.Sp, 1): 
= f (Sp, — 1 — 1) Opa. 
(4). 8fp(:Sp, Op) = fp(2Sp, Oper) — fp(SiS82, +° 
is 1)?fp(1Sp-2, Sp+Sp, bps1) + Spbp+1) 
= f(2Sp, 1, bp.) — f(SS2, 1, bp) +° 
1)?f(Sp-2, Sp+Sp, 1, bp-1) 
+ (—1)?f — (—1)?f (Spa, ap, bp) 
8f (1Sp, 1, bps1) — 1)?f (:Sp, 1) 
+ (—1)?f dp, — (—1)?f ap, bp) bps 
= (— 1)?[f(Sp-1, Op, — f(1Sp, 1) — Gp, Bp) bps]. 


(5). (Sp, bps) = fr(S2, bps1) — fr(S182, bp) +° 
— (—1)"fr(Sr-2, SraSr, 
+ (—1)"fr (Sra, (— 1)? fr Opn 
— (—1)'f (81-2, SraSr, 
— (—1)f (aS Arp, reed ps1) 
(—1)?f (Sra, rp, rbp) Opes 


COHOMOLOGY FOR ASSOCIATIVE ALGEBRAS. 


(6). Case 1. p=2. >» Dpr1) = bs) 
= f(S;, do, bebs) — f (182, 1) b3 — f(S1, ae, 
= — 2b3) + 81° f (ae, 263) — f (Side, 
f(:S2, bs) — 1) ‘bs 
= f (dep, — f(Sidep; + ps1) — 1) 
(6). Case 2. p>2 ba) 


= {8f2—9f2 +> + (—1)?8fp} (Sp, 

[8f(1S2, dsp, J2 — Gap, 2bps1) J 
— [8f (153, ap, + [8F abpsr) 
+ [8f Ja — (1s, Gap; J 


= — 4[8f(S1, dep, J 
— f (G1, S2dsp, + f (S82, 
— f (182, Ssdap, + f(183, Sasp, 


— f(iSps, + f (Spe, Sprdp, pbps1) 

— (—1)?[f (Sp, 4p; 

+ f dp, —f (Sp, 1) — f (Spa, ap, Bp) 
= f (dep, 2dps1) — f(Sidep, + f (Si, abpsi) 

f(S1, Sodzp, 30p41) f (Spe, Sp14p, 

— f (Sp-2, pbpsr) + — p15 Ap, Opbps1) 

+ Gp, Dp) 

+f (Spas bpbpsr) — f(aSp,1) — f (aS bp) 

= f (Gop, 2bpi1) — f (Sidep, + (1S p, — f 1) 


537 
r=2 
| 
e 
P 


538 I. H. ROSE. 


It may now be verified by straightforward computation that the following 
definition of h will satisfy the requirements of the lemma: 


For p= 1, h(S,) = {2u— v}(S8,) — u(8;) - 1. 
For p> 1, h(.Sp) = {(2u—v+ 22 p) 
— {u+ (— (Sp) 


This completes the proof of the lemma. 


LemMA 4.2. Suppose p=0, Be Kyu, feCe(A X B,M), and that 
i>p, Sse B= df = 0. Then there exists he C?(A X B,M) such 
that i> p, S,e B=> (f —8h} (Spur) —0. 


Proof. The result is trivial for p—0, and follows immediately from 
Lemma 4.1 for p> 0. 


4.3. Suppose p=0,r=1, Be fe X B,M) and 
that 
(i) a > Spe B => 5f 0, 


(ii) t>ptl, Ge Bo = 0. 
Then there exists he C™?(A X B,M) such that 
a > Ps Sie B => {f — 8h} 0. 


Proof. We apply section 3, setting D—A XX B, n=r and (in Prop. 
3.1) m=p+1. We have, then, a cochain fe C?*(A X B, M,) such that 


(a) {8f } Bf (1S raps2)- 

Now let NV, be the submodule of M, consisting of all ¢ such that 
(b) i> Sie B= = 0, 
(c) t>pt+2, Spe B= {t- = 0. 


We assert that f e Ce(A X B, N,); for (b), in this case, follows imme- 
diately from (ii). To verify that f satisfies (a), we note first the following 
equality occurring in the proof of Prop. 3.1: 


+ f(S82, sSrapse) + (— ps2, 


539 


COHOMOLOGY FOR ASSOCIATIVE ALGEBRAS. 


Now if 1 > p+ 2 and S,e B, we have the first term in the brackets zero 
by (i) and the remaining terms in the brackets zero by (ii), q.e. d. 

Next we show that if we replace f and M in Lemma 4.1 by f and N, 
respectively, then f satisfies the hypothesis of Lemma 4.1. First observe that 
(i) and (a) imply that f is a cocycle on B. But then, since Be Ky,;, it 
follows that f is a coboundary on B. Next, note that the requirement i > p, 
S,e B => 8f(:Sp.2) =0 also follows from (i) and (a). 

We may therefore conclude that for p > 0 there exists he C°(A X B, N,) 
such that i > p, S;e B => {f —8h}(1Sp1) = 0; the same conclusion is trivial 


for p= 0. 
Now we define he C™?(A X B,M) such that for p>0: J 

= {h(.8p)} rp); for p = 0: h(,S-+) = To show that h satisfies the 

conditions of the lemma, we assert first that 1 > p + 1, B => 8h (1S 


= 0; for we have 5h = {82 (18 p11) } (ps2S p11), and our assertion then 
follows from the definition of N,. 

Referring to (ii) we see that this proves the lemma for 1 > p +1, S;e B: 
There remains to prove only the caseet1—p-+1, S;e¢B. But 


Bh reper) = {8h Dyer) } 
{f Dps1) } (p+2S p+2S reper)» q. d. 
Lemma 4.4. Suppose r=0, p=0, Be Kyu, fe Cr (A X B,M) and 


a Ps S; B => Sf (1S 0. 
Then there exists he C'?(A K B,M) such that 
Sve B= {f — bh} (ep) = 0. 


Proof. We use induction on r. For r= 0, the lemma coincides with 
Lemma 4.2. Suppose, then, that the lemma is true for r= k = 0, and that 
r=-k-+1. Replacing D in Prop. 3.2 by A XB, and setting n—1, 
m=r+p—k+p-+1, we have defined a cochain fe B, M’,) 
such that { (S1) = But then f satisfies the hypothesis 
of the lemma for rk, so that there exists by the inductive hypothesis a 
cochain hy e Ck?(A B, M’,) such that 


i> p, B= —8hy} = 0. 


We now increase the dimension of h, by defining h,e€ Cho1(A X B,M) 
such that = for k + p > 0, and h,(8,) = h,(8;) 
for k+p=0. Then 8h; = — {8h1 (2S } (81). 


¢ 
4 
\ 


540 I. H. ROSE. 


The cochain h, does not quite satisfy our requirements, since we now 
have only:1 > p+ 1, B=> {f — = 0. However, this result 
means that f — dh, satisfies the hypotheses of Lemma 4. 3 for r= k + 1, with 
f replaced by f —éh,. There exists therefore a cochain C**?*1(A B, M) 
such that 1 > p, Sie B=> {f —8h1) — dhe} = 0. 

Clearly, h =h, + hz satisfies the requirements of the lemma. 


Lemma 4.5. Suppose r,p=0, Be Kya, fe Cr (A X B, M’,) and that 
i> p, Sie B= (puSpir2) = 0. 
Then there exists he Cr(A X B, M’,) such that 
i> p, Sie B=> {f — 8h} (pSprpar) = 0. 


Proof. If p=0 the lemma is a special case of Lemma 4.4. Suppose, 
then, that p > 0. We define f as in Prop. 3.2, replacing D by A X B and 
setting m=r-+1,n—p. Then we have {3f (pir Sparse) } = 8f (1S ps2). 
Therefore 1 > p, Sie B=> Sf = 0. 

But then f satisfies the hypothesis of Lemma 4.4. Therefore there exists 
he Cr?(A X B,M) such thatt> p, Sse B => {f —8h} = 0. 

The cochain h defined as follows will now satisfy the requirements of 
the lemma: for r>0, = for r—0, 
h (Sp) = (—1)?h(.Sp). 


THEOREM 4.1. Be X BM) = 2°(B, M’,)], 
(m>0,n= 0). 


Proof. In Lemma 4.5, change the notation by replacing f by f, h by g, 
M’, by P and r+1 by m. The result will be Lemma 3 of [5], (p. 574). 
As an immediate consequence of this lemma Hochschild proves ([5], Theorem 
6, p. 575) the following result: H™[A,Z°(B, M’,)|] =H"(A X B,M’,), 
(m>0,p=0). Theorem 4.1 now follows from Cor. 3. 2. 


5. The K- and L-classes of a Kronecker product. 
THEOREM 5.1. Ae Km, Be Kp => A X Kmipi (m,p>0). 


Proof. Let M be any A X B module, p—=n-+41. Then the hypothesis 
of Theorem 4.1 is satisfied, so that H™"(A B, M) = H™[A, Z°(B, M’,) ]. 
But since AeK, we have that H™[A,Z°(B,M’,)]—0. Therefore 
H™*(A X B,M) —0, i.e. A K Be Kaun = q. 


With reference to determining a lower bound for the K-class of A X B, 


COHOMOLOGY FOR ASSOCIATIVE ALGEBRAS. 541 


only the two simplest cases are settled here. The result for the simplest case 
is given in Cor. 5.1. Lemma 5.1 introduces a general construction which 
seems to yield a result (Theorem 5.2) only for the next case. 


Proposition 5.1. Let M be any B-module (B with or without an 
identity). We make P, the underlying vector space of A XM, ito an 
A XB module by the following definition: 


(a; X b) (a2 m) = aya, XK mM, (a2 XK m)- (a, X 6) = aa, X m:d, 
where a,,d2€ A, be B and meM are elements of a pre-chosen basis for 
A, B,M respectively. This definition is then extended linearly to cover all of 
AX B and P. 

Now let ¢ be the homomorphism of C*(B, M) into C*(A X B, P) defined 
such that {of} =air X for k > 0 and $(m) =1 X m fork =0. 

Then $ induces an isomorphism of H*(B, M) into H*(A X B, P), (k = 0). 

Proof. First we show that For k > 0, 

{8(4f)} = $f — Of (81825 
+ (—1) "$f — (—1)*bf 
= Aik xX f — f 30k+1) 
(—1)*f — (—1)*f (sx) Beat] 
= as X 8f = {6(8f)} (Seer), 
proving our assertion for k > 0. 

For k = 0: 8{(¢m)}(S)=S om: S=(a X b)- (1 KX m)—(1 X m) 
=a X 8m(b) = {¢(8m)}(S), proving our assertion for k = 0. 

Now consider the mapping f—f mod B*(A X B,P). Since $3 = 84, 
this mapping carries cocycles into cocycles and coboundaries into coboundaries. 
Thus all that remains to be proved is that the kernel of this mapping consists 
only of B*(B, M), in other words that if df = 8g, then there exists h e C*(B, M) 


such that f = 6h. 
Let 1 Uy be a basis for A. We define a of 


AXM into 1XM as follows. If ceA XM and e— 3m X mi, let 
mo. Now, supposing of = 8g, let 1 KX h—7qg. Then 

Of (sbx) =1X = 89 (be) = 1 X 
Therefore f = dh, q.e. d. 


542 I. H. ROSE. 


Corotiary 5.1.2 B¢gK,=> AX BEK,, (B with or without identity, 
n=0). 


5.2. AeLy, Be lp» (n=O). 


Proof of Corollary 5.2. AX B¢K, follows from Cor. 5.1; by Theorem 
5.1 we have A X Be = Kan, g.e. 


We now define an algebraic “ cup product ” analogous to the cup product 
defined for groups by Eilenberg and MacLane. ([2], section 4). 


Definition 5.1. Let A,B be algebras with or without identities, 
feCr(A,M), ge C*(B,N), (r,s >0).%. Consider the vector space under- 
lying M X N as an A X B module such that (a x db): (m X n) = (a: m) 
X (b-n), (m Xn): (a X = (m-a) X (nd). 

Then the cup product f U ge Cr**(A X B, M X N) is defined as follows: 

{f Ug} (Sree) = [f (res) ] 9 


It may be verified by straightforward computation that the cup product 
is associative, and we have 


Proposition 5.2. 8(fUg) = 8f Ug+ (—1)7f U 89. 


As a consequence of the associativity of the cup product we may extend 
Definition 5.1 inductively to any finite sequence of cochains; we note also 
that Proposition 5.2 implies that the cup product of cocycles is a cocycle 
and that the cup product of a cocycle and a coboundary (in either order) 
is a coboundary. 


Lemma 5.1. Let feZt(A,M). geZ*(B,N), AX Be (17,8 > 0), 
and suppose the identity elements of A, B act as identity operators on M,N 
respectively, and that f is “ normalized” i.e. 1 StSr, a—1 => f(:ar) = 9. 

Then given sequences a=,0,eA, B=,b,eB, there exist cochains 
fae C**(B,M XN) and fgeCr*(A,M XN) such that 


X 9 (bs) = (1dr) + 


Proof. Let h=fUg. By the remark preceding this lemma, h is a 
cocycle; hence, since A K Be K,,s, h is a coboundary. Therefore there exists 
he C*#4*(A X B,M XN) such that h —&h. Now consider 


? For algebras A, B both with identities, this result is derived in Hochschild [5] ; 
as a consequence of Theorem 7, ibid. 
* Although we shall not find it necessary to do so in the sequel, Def. 5.1 may be 
extended to include the cases r=0, s =0 also. In fact the cochain ¢f of Prop. 5.1 
may be considered as a cup product in which r= 0, s > 0. 


— a 


COHOMOLOGY FOR ASSOCIATIVE ALGEBRAS. 


(I) 
h( sar, ibs) == ° 2dr, 10s) dr X b, obs) 


The arguments of hf in the first column of (I) are to consist of the 
(r +s) !/r!s! permutations of the sequence (14,, 16.) which do not alter the 
order of the a’s or the order of the b’s. The signs at the extreme left of the 
rows of (I) are + or — according as the argument of h is an even or an j 
odd permutation of (14,, 1bs). 

Now we define fg =h(1b,) for r—=1 and fg = (—1)"h(.a,) for s=1. 
For r,s > 1 we let = ,0,) —- ++, where the terms on the 
right are of the form + htiBaies)s with the sequences (15,451) consisting of 
all permutations of (14,1,1b,) which do not alter the order of the a’s or the 
order of the b’s; the sign preceding h is to be + or — according as the 
argument of h is an even or an odd permutation of (14,1,15,). Finally, let 
fa = (—1)"[h 1be-1)—- with a similar convention as to the argu- 


ments of h. 
We assert that the sum of the right hand sides of the equation in (1) is 

8fg(.ar) + 8f2(.b,). For firstly, if a summand on the right has an argument 

in which a term of the type a; X 6; appear, then that summand appears pre- 

cisely twice on the right and with opposite signs, so that all such summands 

disappear in adding the rows of (1); secondly, the remaining terms each 

; occur precisely once and are clearly the terms of the expansions of df4(1b.) é 

and 
Finally, from the hypothesis of the lemma and the definition of h, it 

follows that h(.ar,10s) =f(14+) X g(1bs), while all the other terms on the 

left hand sides of the equations in (I) are zero. The sum of the left hand 

sides of the equations in (I) will then be f(:4-) X g(:1be), which completes 

the proof of the lemma. 


Lemma 5.2. If Af K;, then there exist an A-module M, feZ1(A, M) 
and age A such that the identity of A acts as an identity operator on M, 
f(a.) and me M =a: Ay. 


Proof. Case 1. R=0 is the radical of A and A/F is separable. Then 
A=A/R++R as a supplementary sum, i.e. each element ae A may be 
uniquely expressed as a= a’ rg, where A/R and Let be the 


t 

t 
| 
| 

| 
| 

' 

1 

e 

1 


544 I. H. ROSE. 


projection of A onto R such that r(a) = ra, let M = R/R?, denote r/R? by 7, 
where re and define a- —ar, 7-a—ra, f(a) = 7a. 

Then 1-7—7-1—#; if we let a) be any element of R— R* then 
f (a0) = = Sf +11, + = — + 110'2) + 
= 0, so that fe Z'(A, M); finally, Thus all the require- 
ments of the lemma are satisfied. 


R = 0. 


Case 2. 


In this case A is semisimple and inseparable. Let A, be a simple com- 
ponent of A of dimension m over its center C. It is shown in Hochschild [4], 
Lemma 4.1 that we may make the matrix algebra C,, into an A-module such 
that the identity of A acts as an identity operator on M, and such that there 
exists dy C and fe Z1(A, M) with f(a.) 40 and ay: m m- a for all me M. 
By allowing the other simple components of A to annihilate M and defining f 
to be zero on the other simple components of A, we make M into an A-module 
and extend f to A. The requirements of the lemma are then satisfied. 


Case 3. R40 and A/R is inseparable. 


Since A/R is semisimple, there exists by Case 2 an A/R module M and 
a cocycle feZ*(A/R, M) satisfying the requirements of the lemma, with 
A/R. i, f replacing A, M, F respectively. Letting a@ represent a/R, we may 
achieve the desired result by defining the vector space underlying M to coin- 
cide with that underlying and for me M, ae A defining a:-m 
m:-a=m-a, f(a) f(a). 


THeorEM 5.2. AfKi, B¢Kn => AXB# Kanu, (n>0). 


Proof. Let M,f,a) be as in Lemma 5.2, ge Z"(B, N) — B"(B,N), 
:b0,¢ B and suppose A X Be Ky,,. Then by Lemma 5.1 there exist cochains 
gaeC"(B,M XN), XN) such that f(a) g(1bn) = 8fp(a) 
+ 89a(:bn). 


We may assume (cf. Hochschild [5], section 1) that the identity element 
of B acts as an identity operator on N. Consequently Lemma 5.1 implies 
that 3fg(a@.) =0. Now if we let f(ao) K h(1bn+) be a projection of ga(1bn-1) 
into f(a) X N, we have g = 6h, a contradiction which proves the theorem. 


Ael,, Be AX Belay, (n=O). 


6. The existence of algebras in L,, (n= 0). 


We now prove that none of the classes Z, is null. JZ, is not null, of 


COHOMOLOGY FOR ASSOCIATIVE ALGEBRAS. 545 


course, since I) = K, is the class of algebras separable over /. As for Lh, 
it is proved by Hochschild ([5], section 9) that H’ « K, where H’ is an algebra 
with basis a,7 such that a is a left identity and r a left annihilator. Since 
every separable algebra has an identity, it follows that H’¢ K,. Let H be the 
algebra formed by adjoining an identity 1 to H’. Then H e Jy, since adjoining 
an identity to an algebra does not affect its K-class (cf. Hochschild [5], 


section 2). 


PROPOSITION 6. 1. 


Proof. Let R (with basis r) denote the radical of H. Let Rk, — R, 


fe C'(H, R) such that f(1) = f(a) =0, f(r) =r. Then it is easily verified 
(either directly or from Lemma 5. 2, Case 1) that fe Z*(H, RB). 


Now let g be the cup product fn, where fi, =f. 
Then by the remark following Prop. 5.2, g is a cocycle. If for hye H we 
denote hi. X- X hin by Ui and [have Ani] by 
V, then g(,U,) Vi XK Va. 

In particular, if hy =r and hi —1 for 147, then we denote U; by Wi. 
Now, supposing that g — 6h, consider the following set of equations (for 


convenience we let W = W,W. = W.W;): 

(11) 
g(W,, We, sWn) Wi h(We, sWn)—h(W, sWn) +: + (—1)"h(Wi, We, sWa-1)* Wa 

—g(W., sWn) =— We:h(Wi, sWn) sWn) -—(—1)"h(We, Wi, We 


The arguments of g in the first column of (II) are to consist of the n! 
permutations of the terms of the sequence (,W,). The signs at the extreme 
left of the rows of (II) are to be + or — according as the argument of g 
is an even or an odd permutation of (,Wn). 

We now note that each term in the first and last columns on the right 
a hand side of the equations in (II) is zero, since each of these terms is equal 
: to a Kronecker product with a factor in R? 0. Secondly, note that each 
; term in the columns between those just mentioned occurs precisely twice in 


en 
, 
n- 
a 
h 
re 
id. 
h 
Ly 
) 
: : 
) 
| 
] 
| 
f 


546 I. H. ROSE. 


(II), but with opposite signs. The sum of the right hand terms of (II) is 
therefore zero. On the left hand side of the equations of (II), it is easily 
verified that all terms are zero except g(1.Wn) =r Xr. 

We are therefore led by the assumption g = 6h to the contradiction 
rxXrX-:-:Xr=0. Thus g is a cocycle but not a coboundary; hence 
Es, 


THEOREM 6.1. L, ts not null, (n=0). 


Proof. We have already noted that Lo = K, is not null. For n> 0, as 
an immediate consequence of Theorem 5.1 and Prop. 6.1, it follows that the 
algebra X Hne Ln», where H, =: -=H,—H. 


UNIVERSITY OF MASSACHUSETTS. 


BIBLIOGRAPHY 


[1] S. Eilenberg, “ Topological methods in abstract algebra,” Bulletin of the American 
Mathematical Society, vol. 55 (1949), pp. 3-37. 
[2] S. Eilenberg and S. MacLane, ‘ Cohomology theory in abstract groups. I,” Annals of 
Mathematics, vol. 48 (1947), pp. 51-78. 
, “ Cohomology theory in abstract groups. II,” ibid., vol. 48 (1947), pp. 326- 
341. 
[4] G. Hochschild, “ On the cohomology groups of an associative algebra,” ibid., vol. 46 
(1945), pp. 58-67. 


[3] 


[5] , “On the cohomology theory for associative algebras,” ibid., vol. 47 (1946), 
pp. 568-579. 
[6] , “Cohomology and representations of associative algebras,” Duke Mathe- 


matical Journal, vol. 14 (1947), pp. 921-948. 


ON RELATED PERIODIC MAPS.* 


By E. E. Fioyp. 


1. Introduction. Consider a class of periodic maps defined on a topo- 
logical space XY. We are concerned with special cases of the following problem. 
Suppose the maps of the class are all related in some specified fashion. Are 
there, then, any implied relationships between the fixed point sets of the maps 
of the class? 

A notable example of a problem of this sort has been solved recently 
by S. D. Liao [5]. If X is a finite dimensional compact Hausdorff space 
which has the homology groups of an n-sphere over the group IJ, of integers 
mod p with p prime, and if T is periodic of period p on X, then, as P. A. Smith 
has proved ([8], p. 366), the fixed point set Z has the homology groups of a 
r-sphere for some —1r<n. Liao settled a problem proposed by Smith 
by proving that if X also has finitely generated integral cohomology groups, 


_ then n — r is even or odd according as T is orientation preserving or orientation 


reversing. 

In section 1, we generalize Liao’s result by proving that if X is a finite 
dimensional compact Hausdorff space with finitely generated integral co- 
homology groups, and if 7 is periodic of prime power period p* on X, then 
the Lefschetz fixed point number of 7' is equal to the Euler characteristic of 
L (defined using J, as coefficient group). We also extend a result of Smith 
({9], p. 162) concerning the non-existence of certain types of periodic maps 
of arbitrarily large period on n-manifoids with negative Euler characteristic. 
The methods of this section depend heavily on recent results of Liao [5] and 
of the author [4] which in turn are based on the special homology groups of 
Smith [8]. 

In section 2, we consider a periodic map 7’ of prime power period q? 
and then consider the class of all periodic maps 7’, of the same period which 
are “sufficiently close” to J. Under these circumstances, we prove that the 
fixed point set Z, of T, is close to L in the sense of Begle’s metric [1] 
induced by the regular convergence introduced by Whyburn [11]. 

The author has read a pre-publication copy of Mr. Liao’s paper [5], and 
wishes to thank Mr. Liao for that privilege. 


* Received August 24, 1951; revised October 25, 1951. 


f 

6 

547 
2 


E. E. FLOYD. 


2. The Lefschetz fixed point number of T. A periodic map on a space 
X generates a periodic linear isomorphism on the rational homology groups 
of X. We require later in the section an analysis of the latter. We dispose 
of this first, using a procedure similar to one used by Smith ([9], pp. 161- 
162) for a similar purpose. 

Suppose V is a finite dimensional vector space over the rationals R. 
If W is a subspace of V, let dW denote the dimension of W. Let T be a 
linear transformation on V with T? = identity. There are associated with T 
the linear transformationso—1+7+ --+ andr=1-—T. Clearly 
or =to0=0. We use the following preliminary remark (cf. [5], 4.11). 


(2.1) Image o = kernel r. 


If m is a matrix presentation of 7, then we call its characteristic equation 
f(t) the characteristic equation of 7. The characteristic roots of JT are 
p-th roots of unity, for if | m — AI | = 0, then 0 = | m? — AP | = (1 — A?)”, 
Moreover, if no T#, 0<i<p, has non-zero fixed points, then every 
characteristic root A is a primitive p-th root of unity. For if A’ —1, then 
| m'| Hence there exists xe ~0, with =z. 
But then ] = p, so A is a primitive p-th root. 

Since f(¢) has rational coefficients and all its roots are p-th roots of unity, 
then f(t) =fs,(t)- + -fs,(¢) where fs,(¢) is the cyclotomic equation of 
degree #(s;), and ¢ is Euler’s ¢-function, whose roots are the primitive s;-th 
roots of unity. Moreover it may be seen that s; divides p. In the following, 
we use V(S) to represent the fixed point set of the linear transformation 8. 


(2.2) Let T be a linear transformation on the finite-dimensional rational 


vector space V with T? =identity. Then 


(a) if pis prime, there exists a non-negative integer k with dV = dV(T) 
+ k(p—1); moreover, trace T=—dV(T) —k; 


(b) if p=q* where q is prime and a>1, then trace T = trace 

Proof. To prove (a), decompose V into V(7T)@ Vi, where T(Vi) = V: 
(cf. the proof of (2.1)). The characteristic equation of T | V; has as roots 
only primitive p-th roots of unity. Hence its characteristic equation is of the 
form (fp(t))*. Since the degree of f,(¢) is p—1, dV =dV(T) + k(p—1). 
The trace of T | V, is then k(a,-+----+ ap), where the a’s are the 
primitive p-th roots of unity. Hence the trace of T | V; is —k. So (a) 


follows. 


548 


ON RELATED PERIODIC MAPS. 549 


To prove (b), decompose V into V(T%")@ V,, where T(V,) = Vi. 
Then the characteristic equation of J | V, is of the form (fp(t))*, and the 
trace of T'| V, is k(a,-+- -+ agcp)), where the are the primitive p-th 
roots of unity. It may then be seen that the trace of T'| V; is 0. So (b) 
follows. 

Suppose now that X is a compact Hausdorff space, and let JT be a map 
of X into X. Let H,»(X;F) denote the Cech homology group of X over 
the field /’, and 7’,, the induced linear transformation on H,(X;F). Define 
x(X; F) = 3(—1)*dH;(X; F), in case the right hand side is defined and 
finite, and call y(X;#) the Euler characteristic of X over F. Also define 
a(T; F) = 3(—1)* trace T,i, in case xy(X; F) exists, and call F’) the 
Lefschetz fixed point number of T over Ff ([6], p. 319). 

We suppose now that X is a finite dimensional compact Hausdorff space 
with finitely generated integral Cech cohomology groups. Let T denote a 
periodic map on X of prime period p. Let Z denote the fixed point set of T, 
and Y the orbit decomposition space of 7. We have occasion to use the 
following recent results. Of these, (2.3), (2.4), and (2.5) are due to Liao 
[5], and (2.6) to the author [4]. 


(2.3) (Liao). Y has finitely generated cohomology groups. 


Liao ({5], Theorem 5.5) has given a proof for this in case X has the 
groups of an n-sphere over Jp. The proof used the extra assumption only to 
insure that LZ has finitely generated groups over Jp. Since this is true in 
the general case ([4], Theorem 4.2), the proof then holds. 


(2.4) (Liao). Ip) =x(X; R), x(V; Ip) = x(¥; ([5], Theorem 2. 8). 


(2.5) If »:X—Y denotes the orbit decomposition map, then n, maps 
[e| ve H,(X; Rh), isomorphically onto H,(Y; RP). 


This result is more or less implicit in the work of Liao (cf. [5], 4. 3, 
4.11, 4.13). Because of its importance here, we outline, using the notation 
of [5; §4], a direct argument. For each 6,,¢C,(0(Ky,T,);R), let 
sy € C;(K)y, R) be such that mr (asr) == De. Define Ex = It may 
be verified that is uniquely defined, that = & 0, and that myéy = 
Moreover, and = Hence there is induced 
with = pr,reH,(0(X,T);R), 
&y(z) =o(x),xeH,(X;R). Since is an isomorphism onto, maps 
image isomorphically onto H,(0(X,7T);R). Since is onto and =<, 
we have image = image o. But by (2.1) image o kernel The assertion 


follows. 


s 

a 
T 
ly 
yn 
re 
y 
n 
y; 
of 
h § 
g, 
al 

j 
ce E 
ts 
ne 
ne 


E. E. FLOYD. 


(2.6) x(X3Lp) + (p—1)x(L3 Lo) = px(¥5 Ly). [4]. 
We are now in a position to prove the main theorem of this section. 


(2.7%) THrorem. Let X be a finite dimensional compact Hausdorff 
space with finitely generated integral Cech cohomology groups. Let T be a 
periodic map on X of period q*, q prime. Let L be the fixed point set of T. 
Then a(T; R) = x(L; Iq). 


Proof. We prove the theorem first fora—1. Consider Tyn: Hn(X ; BR) 
— H,(X;R). According to (2.5), the fixed point set of Tn is isomorphic 
to H,(Y;R). Hence by (2.5), 


dH,(X;R) =dH,(¥;R) + [dH,(Y; R) —trace T4n](p —1) 


so that dH, (X;R) + (p—1) trace T4n = pdH,(Y;R). Taking the alter- 
nating sum, we get x(X; Ff) + (p—1)a(T; Rh) Using (2. 4) 
and comparing with (2.6), we get a(7;R) = x(L; Ip). 


Suppose a >1 and suppose the theorem has been proven for a—1l. 
Consider 7, = T?". Let Y, denote the orbit space of the map 7, on X, 
and f:X—Y, the natural decomposition map. Define a map 8S: Y,—>Y, 
by Sf=fT. Then § is of period g** on Y;. Also, by (2.3), Y, has finitely 
generated integral cohomology groups. Hence, by the induction hypothesis, 
a(S; R) = x(L’;1,), where L’ is the fixed point set of 8. 

We point out that Z and L’ are homeomorphic. Clearly, f(Z)C L’ and 
fis1-lon LZ. We prove that f(L) =L’. Let ye L’, where y=f (zx), reX. 
Then f(r) =Sf(x) =fT(xz) so T(x) =T,*(x) for some &. But then 
—1 is a period for xz, so kg**—1 divides q*. Hence k = 0, so that 
So x(L’31[q) = x(L3 

Finally, Let Tot — 7]. 
Then, by (2.3), f, map3 /’, isomorphically onto H,(Y,;R). Moreover, since 
we have trace =trace(T,,/,). But by 
(2. 2), trace(T,, = trace(T,, Hn(X ;R)). It follows that a(S; R)—=a(T; R) 
and the theorem follows. 

We now turn to some results concerned with properties of periodic maps 
of large period. 


(2.8) (Smith). Let V be a finite dimensional rational vector space. 
There exists a positive integer r associated with V so that if T 1s any linear 
transformation on V with T? identity where p>r, then there exists 
p with Ti = identity. 


550 


ON RELATED PERIODIC MAPS. 551 


Proof. We shall outline the proof ([9], pp. 161-162). Suppose 
p = pi po”: - where the are primes with p, < Pe. 
Define 


=> (pit) if A2 or a, A1; O(p) = 2 o(pi") otherwise. 


Then ®(p) > as p—>o. We point out that if ®(p) > dV, then there exists 
1=j <r with Ti =identity. For suppose this is not the case. Using the 
notation preceding (2.2), we have f(t) =fs,(t)---fa,(t), where s;| p. 
Now each p;* divides some s; For if not, each s; divides p/p; = q, so that 
identity. But if each divides some s;, it may be checked that 
dV =3¢(si)=(p). Hence = dV, and the assertion follows. 


(2.9) As a consequence of (2.8), let X be a compact Hausdorff space 
with each H,(X;R) of finite dimension and = 0 for all but a finite number 
of n’s. There exists a positive integer r so that if T is any periodic map on 
X, then ts, for some 1 Sj Sr, the identity 
for all n. 


We denote the least such r by r(X). 


(2.10) THeorEM. Let X be a finite dimensional compact Hausdorff 
space with finitely generated integral cohomology groups. Let T be a periodic 
map on X of period p>r(X). There exists 1S1< p such that p/ii=—q 
is a prime, and such that if L;, denotes the fixed point set of Ti, then 
x(X 3B) = x(Li, Iq). 


Proof. There exists, by (2.9), =r with = identity for all n. 
Suppose p= j-k-q, where k and q are positive integers with g prime. Let 
t= Then for all n. Hence by (2.7), a(T#; R) 
= (X;R) =x(Li 


The following is an extension of a result of Smith [9; 162]. It also 
generalizes the well-known theorem [11] that the periodic maps on a compact 
2-manifold with negative Euler characteristic have uniformly bounded periods. 
It does not, however, provide the upper bound known for that case. 


(2.11) THrorem. Let X be a compact manifold with x(X;R) < 0. 
Suppose T is a periodic map on X of period p, and such that if 1 <j <.p, 
then the dimension of the fixed point set of Ti is <1. Then p<r(X). 


Proof. Suppose p>r(X). Let i be the number given by (2.10). 
Then x(X;R) =x(Li;Ig) <0. But dimZ;<1, so that by a result of 


6 
| 

“ 
| 


552 E. E. FLOYD. 
Smith ([10], p. 704), Z; is the union of a disjoint collection of points and 


simple closed curves. Hence x(Z;;J,) = 0, which is a contradiction. 


(2.12) The above theorem is not true if the restriction on the dimension 
of the fixed point set of T/ is removed. 


As an example, let X be a 2-sphere, and let Y be a 2-manifold with 
x(Y) <0. Then x(X K Y) = x(X)x(VY) <0. But since X admits trans- 
formations of arbitrary period, so does X X Y. 


3. Convergence properties. We begin section 3 by stating an important 
result due to Smith [7] which is the basis for the work of this section. The 
result is stated and proved in the proof of Theorems I, II in [7]. 


(3.1) (Smith). Let X be a locally compact n-dimensional Hausdorff 
space, n <0, and let T be a periodic map on X of prime period p. Denote 
by L the fixed point set of T. Suppose 0 A,C- ++ C Am, m = pn p, 
is a sequence of compact subsets of X, with T(A;) = Aj, and with every Cech 
cycle in A; over Ip bounding in Ais. Then LN Am 0 and every cycle in 
LIL Ao over Ip bounds in LN Am. 


We use also the concept of regular convergence introduced by Whyburn 
[12]. We shall phrase the definition in terms of Cech theory instead of 
Vietoris theory ; these are interchangeable, as follows from the full equivalence 
of the two theories ([6], p. 277). Let X be a locally compact metric space, 
and let G be an abelian group. Let [ A;] be a sequence of closed subsets of X, 
with A; converging to a closed subset A of X. If n is a non-negative integer, 
then A; converges n-regularly to A over G if and only if given ge A and a 
compact neighborhood U of z in X, there exists a closed neighborhood V of x 
(in X) with V C U, and a positive integer J, so that every Cech cycle in 
VM A; over G of dimension =n bounds in UN A; fori>TJ. It may be 
seen that X is Ic" (i. e., homologically locally connected over G in the dimen- 
sions from 0 to n), if and only if the sequence X, XY,- - - converges n-regularly 
to X. 

Let X and Y be metric spaces. Let A; be a sequence of closed subsets 
of X which converges to a subset A of XY. Let fj: Ai mY, f: A— Y be con- 
tinuous. We shall say that f; converges continuously to f if and only if 
whenever 2; —> 2, 7 € Aj, then fi(z;) > f(x). This specializes, in case A; = A, 
to the notion of continuous convergence introduced by Carathéodory ([2], 
p. 58). 


ON RELATED PERIODIC MAPS. 553 


(3.2) THrorem. Let X be a locally compact n-dimensional metric space. 
n<o. Suppose [Aj] is a sequence of closed subsets of X converging n- 
regularly over Ip, p prime, to the subset A of X. Let T; be a continuous 
periodic transformation of period p on Aj, such that [Ti] converges con- 
tinuously to the continuous function T on A. Then the fixed point set [Fi] 
of T; converges n-regularly over I, to the fixed point set F of T. 


Proof. The reader may verify that if ee F and U is a neighborhood 
of x (in X), then there exists a neighborhood V of x and a positive integer I 
such that if i > J, then UjT#(V N Ai) C U. 


Let xe F and let U be a compact neighborhood of z. There exists a 
sequence U = Usms1 D Uom D+ + - D Uo, m= pn + p, of compact neighbor- 
hoods of x (in X) and a positive integer J, such that U) N Ai 0, for 1 > I, 
and (a) if then Ai) C for =0,- - -, 2m, and (b) 
for i > I every cycle in U,M A; over Jp bounds in Uy, N Aj. 

For each 0OSkS™m and each define Vig = N Ai). 
Then C Ai, and T;(Vii) = Vii. Moreover, since Vissi D Verse 
A Aj, every cycle in Vz, bounds in Vz... Hence we may apply (3.1) to the 
sequence Vo; C Vi; Vmi, and the transformation 7;. It follows 
that Vni 0, and every cycle in Voi F; bounds in VmiN Fi. Hence, 
for i> TJ every cycle in F; bounds in UN Fi, and UN 

To finish the proof, the reader has only to note that if am,¢/m,, and 
Im,—z, then xe F. This follows easily from continuous convergence. 


(3.3) Corottary. Let X be a locally compact n-dimensional metric 
space, n <0, which is lc" over Ip, p prime. Let [Tj] be a sequence of periodic 
maps on X of common period p%, which converges continuously to the con- 
tinuous map T. Then the fixed point set F; of T; converges n-regularly over 
I, to the fixed point set F of T. 


Proof. The proof is a straight-forward combination of (3.2) together 
with a procedure used often by Smith for extending proofs from period p to 
period p* ([8], p. 367). 


(3.4) Corottary. Let the hypotheses be those of (3.3) and suppose in 
addition that X is compact. Then there exists I such that fori >I, we have 
H;(Fi;Ip) ~ H,;(F;Ip) for all j. In particular, suppose X an n-sphere. 
Then there exists an integer r so that Fi, i >I, and F are all homological 
r-spheres over Ip. 


This follows from a theorem of Begle [1]. 


Proof. 


d 
n 
h 
h 
n 
n 
yf 
q 
a q 
4 
n 
y 
if 


E. E. FLOYD. 


(3.5) Corottary. Let X be an n-dimensional compact metric space, 
n<0oo, which is lc" over Ip, p prime. Let T be a periodic map on X of 
period p* with fixed point set L. There is ane > 0 such that if T, ts periodic 
on X of period p*, p(T (x), T:(x))<« for all re X, and L, denotes the fired 
point set of T,, then H;(L;Ip) ~ H;j(Li; Ip) for all 7. 


UNIVERSITY OF VIRGINIA. 


BIBLIOGRAPHY. 


[1] E. G. Begle, “ Regular convergence,” Duke Mathematical Journal, vol. 11 (1944), 
pp. 441-450. 
[2] C. Carathéodory, Conformal representation, Cambridge, 1932. 
[3] E. E. Floyd, “ Examples of fixed point sets of periodic maps,” Annals of Mathe- 
matics, vol. 55 (1952), pp. 167-171. 
[4] , “On periodic maps and the Euler characteristics of associated spaces,” 
Transactions of the American Mathematical Society, vol. 72 (1952), pp. 
138-147. 
[5] S. D. Liao, “A theorem on periodic transformations of homology spheres,” Annals 
of Mathematics (to appear). 
{6] S. Lefschetz, Algebraic topology, New York, 1942. 
[7] P. A. Smith, “ Fixed point theorems for periodic maps,” American Journal of 
Mathematics, vol. 63 (1941), pp. 1-8. 
[8] , “Fixed points of periodic transformations,” Appendix B in [6]. 
[9] , “Periodic and nearly periodic transformations,” Lectures in Topology, 
Ann Arbor, 1941, pp. 159-190. 
[10] , “Transformations of finite period II,” Annals of Mathematics, vol. 39 
(1938), pp. 127-164. 
[11] F. Steiger, “Die maximalen Ordnungen periodischer topologischer Abbildungen 
geschlosser Flichen in sich,” Commentarii Mathematici Helvetici, vol. 8 
(1935), pp. 48-69. 
[12] G. T. Whyburn, “On sequences and limiting sets,” Fundementa Mathematicae, 
vol. 25 (1935), pp. 408-426. 


554 


TOPOLOGY OF METRIC COMPLEXES.* 
By C. H. DowKeEr. 


The metric complexes (polyhedra) discussed in this paper are metric 
spaces with a cell decomposition and an affine structure for each cell. These 
complexes are subject to certain mild conditions (section 9, conditions a and b’) 
which, for example, ensure local connectedness. The complexes are not, how- 
ever, required to be finite or countable. They may be curved and they need 
not be locally finite. 

If a complex is star-finite, and if the closed cells are given their usual ~ 
topology, then the topology of the whole complex is uniquely determined. 
However, if the complex is not star-finite, there is no longer a unique topolgy. 
J. H. C. Whitehead has chosen for his topological polyhedra ([11], pages 315- 
321) the finest topology consistent with the usual topology for the closed cells. 
This is a very convenient and useful topology, but with it all complexes except 
star-finiteones become non-metrizable spaces. 

In connection with his studies of local connectedness, S. Lefschetz ([9], 
Chapter I) has chosen two particular ways of giving a complex a metric. If 
the complex is not star-finite, the topology induced by each of these metrics 
is necessarily less fine than the Whitehead topology. In general, the two 
different Lefschetz metrics induce different topologies. 

In this paper, instead of choosing some particular metric, we take a 
somewhat axiomatic point of view and state conditions which should be satisfied 
by any metric complex. Our theorems are then shown to be consequences of 
these conditions. However, our method is not one of proof directly from the 
axioms. Instead, we use the method of comparing each metric complex with 
the corresponding Whitehead complex, that is, the same complex retopologized 
with the Whitehead topology. 

In the first chapter, we discuss affine complexes. These are sets which 
have a cell decomposition and an affine structure for each cell, but they have 
no topology. These affine complexes have homology and cohomology groups, 
and the theorem on invariance under subdivision holds. 

In the second chapter we add topology to the affine complex, and state 
conditions on the topology in order that the complex may be called a topological 


complex. 


* Received August 10, 1950. 


» | 
if 3 

9 
i 
555 


556 Cc. H. DOWKER. 


In the third chapter we discuss Whitehead complexes, that is, affine com- 
plexes with the Whitehead topology. Since the method of investigating metric 
complexes is that of comparison with the corresponding Whitehead complexes, 
we give a rather complete resumé of the known theorems on Whitehead 
complexes. 

The fourth and main chapter contains the results on metric complexes. 
A metric complex is defined to be a topological complex whose topology is 
induced by a metric. Given a metric complex we construct (sections 10-13) 
a sequence of locally finite coverings of the complex by open sets, and using 
this sequence of coverings (sections 14-15) we construct a homotopy of the 
identity mapping of the complex. Then (section 16) by means of this homo- 
topy, we prove that each metric complex has the same homotopy type as the 
corresponding Whitehead complex. It follows that any two isomorphic metric 
complexes have the same homotopy type. In section 17 we discuss the 
mappings of a space in the nerve of a covering when this nerve is a metric 
complex. In section 18 we show the topological invariance of the homology 
and cohomology groups of metric complexes. 


I. Affine Complexes. 


1. Definition and properties of affine complexes. By a convex cell FL 
of a Euclidean space we mean an open bounded convex cell; its closure Z is 
called a closed convex cell. 

Let a set X be the union of a family {e,} of mutually disjoint subsets e, 
of K. Let each eg be associated with a 1-1 transformation ¢, of some closed 
convex cell £, into X such that (i) ¢. maps the convex cell Hq onto e, and 
(ii) if #’ is any face of HZ, then ¢,(E’) is an eg, and ¢q‘¢g is an affine 
(linear) transformation of Hg onto £’. Then the set X, together with the 
decomposition {e,} and the family {¢,.} of transformations, is called an affine 
complex. 

Each of the subsets eg with the linear structure given by the trans- 
formation ¢a| Ha: Ea — éa, is called a cell of the complex. The dimension of 
the cell e, is defined to be the dimension of 2,. Each cell of dimension zero 
contains a single point which is called a vertex. If eg —¢,(E’), where EL’ 
is a face of Hg, eg is called a face of eg; we write eg Seg. If EH’ is a proper 
face of Eq, eg is called a proper face of eg; we write eg < eg. By a finite affine 
complex we mean one with only a finite number of cells. By a star-finite 
affine complex we mean an affine complex which, for each cell eg, has only 4 
finite number of cells e, with eg = eg. 


TOPOLOGY OF METRIC COMPLEXES. 557 


If K is an affine complex, the cells eg of A may be oriented by assigning 
orientations to the corresponding convex cells Fy. If eg is an (r—1)- 
dimensional face of an r-cell e,, the incidence number [ eq: eg] is defined to be 
the incidence number [Z,: E’], provided ¢a-'¢g: Eg EL’ is orentation pre- 
serving, and to be — [#,: H’] otherwise. With such a definition of orientation 
and incidence numbers, the affine complex becomes a closure finite oriented 
cell complex ([8], page 89). Thus, given a topological abelian group G and 
a non-negative integer p, we may define the p-dimensional cohomology group 
H»*(K,G), and if G@ is discrete we may define the p-dimensional homology 
group H,(K, G). 

The elements of the underlying set X of an affine complex K are called 
the points of K. A closed cell é, of K (the closure of ea) is defined to be the 
image set ¢4(H,) together with the transformation ¢,. If z and y are points 
of é,, the closed segment [x,y] of é is defined to be the image by ¢, of the 
closed segment of joining to ¢dat(y). If the point 
tx +(1—t)y which divides the segment from z to y in é in the ratio t: 1 —? 
is defined to be the image by ¢a of the point dividing the segment from 
da to oa*(y) in the ratio ¢:1—t. Similarly, a convex set in is 
defined to be the image by ¢q of a convex set of Hy. If A is a subset of éq, 
the convex hull A* of A in @, is the image by ¢, of the convex hull of ¢,7*A 
in £,; thus A* is the least convex set of @, containing A. 

Note that [x,y] may depend on é, as well as on 2 and y. However, 
if eg is a face of eg, and if x and y are points of ég, then the segment [2, y] 
in ég is the same as the segment [z,y] in é, and the point tr +(1—t)y 
in ég is the same as the point tz + (1—‘t)y in ég. In fact, since da ¢¢: 
Eg—> EB, is affine, the affine structure of ég is that induced by the affine 
structure of é,. 


It is clear that the set é, is the union of all the faces of eg; é. = L) eg. 


The star Ste, of a cell eg of K is defined to be the union of all cells eg 
such that eg is a face of eg; Steg—= LU eg. Note that eg C é@g, eg C St eg, 


€a =a Steg. The following three statements are equivalent: (i) eg S ea, 
(ii) éq, (iii) St egC St eg. 

If x is a point of K, e(x) is defined to be the unique cell e, containing 2, 
and é(z) is defined to be the closure of e(x). The following five statements 
are equivalent: (i) reé(y), (ii) é@(4)Cé(y), (iii) e(a)Se(y), (iv) 
St e(y) CSte(z), (v) ye Ste(z). 

Two affine complexes K, and K, are called isomorphic if there is a 1-1 
order preserving correspondence between the set of cells of K, and the set of 


d 

) 

e 

e ‘ 

c 

y 

is 

a, 

d i 
d 

e 
of 

0 

‘af 

r 
te 


558 C. H. DOWKER. 


cells of K.. The correspondence is called an isomorphism. It can be shown 
(See [1], page 127) that under an isomorphism, each cell corresponds to a 
cell of the same dimension. 

Suppose there is given a subcollection of cells of an affine complex K 
such that if any cell e, is in the subcollection, so is each face of eg. ‘Then 
the union L of the cells eg of the subcollection is a set of points decomposed 
into cells eg which are associated with transformations ¢,:£,— L. In fact 
L is an affine complex which is called a subcomplex of K. In particular, if e, 
is a cell of K, the set @, with the obvious cell decomposition is a subcomplex 
of K. Also, the union of the closures of the cells contained in Steg, with 
the obvious cell decomposition, forms a subcomplex which we call St és. 

If K and L are affine complexes, the product set K K L can be decom- 
posed into cells ¢ag = ea X eg. For each egg let La XK Eg be the closed convex 
cell which is the cartesian product of Ha and Hg, and let dag: La X Ep 
be defined by ¢ag(z, y) = (¢a(X), It is easily verified 
that K X LZ thus becomes an affine complex. This complex is called the 
product complex of K and L. 

An affine complex K is called simplicial if (i) for each eg, Ha is a 
simplex, and (ii) each non-empty intersection &,M ég of two closed cells of 
K is a closed cell é,. For a discussion of simplicial affine complexes see 


([9], § 4). 


2. Subdivision. A subdivision of an affine complex K is a 1-1 trans- 
formation Sd: K-—> K’ of K onto an affine complex K’ such that (i) the 
image of each cell of K consists of the union of a finite number of cells, and 
(ii) the inverse transformation is linear on each closed cell of K’. We shall 
also say that the affine complex K’ is a subdivision of K. 

By condition (i), for each cell e, of K’ there is a unique cell eg of K 
such that e,C Sdeg. Condition (ii) means that, if eg and eg have affine 
structures given by and ¢g: ég, then Ea Ep 
is linear. 


(2.1) Isomorphic affine compleres have isomorphic simplicial sub- 


divisions. 


Proof. The barycentric subdivision ([1], page 135) of an affine complex 
is a simplicial affine complex. Isomorphic affine complexes have isomorphic 
barycentric subdivisions. 


If K and L are isomorphic simplicial affine complexes, there 


| 
(2.2) 


TOPOLOGY OF METRIC COMPLEXES. 559 


exists a map f of K onto L which maps each cell of K onto a corresponding 
cell of L, and which is linear on each closed cell. 


Proof. The natural barycentric mapping ([9], p. 7; [1], p. 138, § 6) 
has the required properties. 


Given a subdivision Sd: K > K’ of an affine complex K, let a chain 
transformation pp: C,(K)— C,(K’) be defined as follows: If eg? is an elemen- 
tary p-chain of K, let ppeg? = Seqeq, where the summation is over all « such 
that eg? is a p-cell of K’, eg?C Sd eg?, and eg = 1 if ¢g-1Sd-"¢, is orientation 
preserving, «, ——1 if it is orientation reversing. 

It is assumed known that, in the finite complex Sd ég, péeg? = dpeg?. 
Hence pd = dp: C,(K) — Cp+(K’) ; thus p is a chain mapping ([6], p. 411; 
[8], p. 145). It is to be shown that p is a chain equivalence ([6], page 414). 

If eaC Sd eg, let T eg be the finite subcomplex ég of K. We define chain 
transformations 7, : Cp(K’)— C,(K) such that (i) tp_10 = Orp: Cp(K’) > Cp1(K), 
and (ii) rtpéq” is a chain of the subcomplex T e,? of K. For each elementary 
0-chain e,°, + @,° is chosen to be any elementary 0-chain in Te,°. If eq? is an 
elementary 1-chain, 70 e,' is a bounding 0-cycle in T e,', and we chose as 7 éq' 
any 1-chain in T e,' bounded by If p> 1, assume that r has been 
defined for dimensions less than p. Then 70 eg? has been defined and is a 
chain in T e,?. Since 070 eg? = 700 = 0, 70 eg? is a cycle in T hence, 
since T e,” is acyclic in dimension greater than zero, 7d eg? is a bounding cycle 
in Teg”. Let 7 a? be chosen as a chain of T e,? whose boundary is 70 e,?. 

In particular, if K and K’ are simplicial, there is a simplicial map 
a: K’-—» K, called a projection, which maps each vertex v of K’ into a vertex 
of Tv. Then induces chain mappings 7»: C (K’) > C,(K) such that eg? 
is a chain of T e,?.. We may then take ry = zp. 

It is to be shown that typ,:Cp>(K) -C,(K) is the identity. This is 
clear for p—=0. Assume that p> 0 and that it is proved for dimensions 
less than p. Then zp e? is a chain of TSd eg? = @4?, and Arp eg? = rp eg? = 0 eg? 
by the induction hypothesis. But the only chain of é,” with boundary @ e,? is 
the chain e,?. Hence rp ¢,? = eg. It follows that rp is the identity chain 
mapping. 

We now show that the chain mapping pv is chain homotopic to the 
identity. We define a homomorphism Dy,,,:Cp(K’) > Cp5.:(K’) so that 
dDc? +- Dac? = c® — pre?, and so that De,? is a chain of SdT e,?. If e,° is 
an elementary 0-chain, — preg? — Ddeg® = — preg® is a bounding 0- 
chain in SdT e,°. Let D,eq° be chosen as a 1-chain in SdT e,° whose boundary 
is ¢.°—pr,°. Suppose that, for p >0, D has been defined for chains of 


a j 
n 

d 

t 
a, 

h 

x 

B 
d 

a 
yf 

e 

e 
6 
)- 

x 

e 


560 C. H. DOWKER. 


dimension less than p. Then Dée,? has been defined. Let cy = eg? — pre,” 
— Dée,”. Then cp is a p-chain in SdT e,?, and by computation one finds that 
dcp = 0. Thus cp is a p-cycle in SdT eg’, a subcomplex which is acyclic in 
dimensions greater than zero. Hence ¢,y is a bounding cycle in SdT e,?. Let 
Dpy.1€a” be chosen as a (p+ 1)-chain in SdT e,? bounded by ¢p. 

Thus 7p is the identity and pr is chain homotopic to the identity. There- 


fore, p is a chain equivalence, and we have 


(2.3) If K’ ts a subdivision of an affine complex K, the homology 
group H,(K’) is isomorphic with H,(K), and the cohomology group H?(K’) 
is isomorphic with H?(K). If K and K’ are simplicial, the homomorphisms 
H,)(K’) H,(K) and H?(K’) induced by the projection 


a: K’— K are isomorphisms onto. 


II. Topological complexes. 


3. Definition of topological complexes. An affine complex K is called 


a topological complex if its underlying point set is a topological space, and if 
(a) each da: Eg— éq is a homeomorphism, 
(b) for each neighborhood U of each point z of K, there is some neigh- 


borhood V of z such that, for each point y in V, re é(y), and the segment 
[x,y] in é(y) is contained in U. 


From condition (b) it follows immediately that as a space, K is locally 
connected. Also by condition (b), each point x has a neighborhood V such 
that, for each ye V, ye St e(x), hence such that VC Ste(zr). If eq is any 
cell of K, and if re Ste, then eg = e(x), and Ste(x)C Steg. Thus each 
point z of St eg has a neighborhood VC St eg; hence St eg is open. Thus the 
star of any cell is an open set containing the cell. However, as the example 
below shows, one can have open stars and local connectedness with condition 
(b) not satisfied. 

The complement in K of a closed cell é, is the union of the stars of the 
cells in the complement; hence K — é, is open, and é, is a closed set in K. 
In fact, é, is the topological closure in K of the subset eg. By condition (a), 
€, is compact. 


Example. Let X be the subset of the cartesian plane with the following 
cell decomposition. The 0-cells of K are Ao:(1,0) and A,:(2,1/n) for 
m=1,2,- +--+. The 1-cells of K are the segments A,An,; and the broken lines 
Ao, (—1/n,—1/n), (—1/n,1/n), An. The 2-cells of K are the regions 


re- 


gy 
ms 
‘on 


561 


TOPOLOGY OF METRIC COMPLEXES. 


ApAnAns: bounded by the 1-cells AnAns:1, AoAn and Some suitable 
choice of the maps ¢z, is to be made. 


It may easily be verified that the space X of the example is locally con- 
nected, and that the star of each cell is an open set. However, condition (b) 
is not satisfied at the point Ao, and hence the complex is not a topological 
complex. 

If K is any topological complex, an affine subcomplex is a topological 
subspace. Clearly conditions (a) and (b) hold also for the subcomplex. 
Thus a subcomplex of a topological complex is a topological complex. 

If K, and K, are topological complexes, then K, X Kz is an affine com- 
plex and is also a topological space, the topological product of K, and Kz. 
Clearly dag: Ha K Eg — (= X &g) is a homeomorphism if La 
and ¢g: Eg — ég are homeomorphisms. Thus condition (a) is satisfied. Any 
neighborhood of a point (2,22) of K,; X Kz contains a neighborhood of the 
form U, X Uz. Let neighborhoods V, of z, in K, and Vz of x, in Kz be 
chosen as in condition (b). Then V, X Vz is a neighborhood of (2;, x2) 
such that if (41,y2)eVi X Ve, then 2,€6@(y:), %2eé(y2), and hence 
(21,22) €€(41) XK &(Y2) = and the segment [ (21,22), (Y1, in 
is contained in [2,41] X [%2, y¥2]C U1 U2. Thus condition (b) 
is satisfied, and we have 


(3. 1) 


the product of two topological complexes 1s a topological complez. 


III. Whitehead complexes. 


4, The Whitehead topology. An arbitrary affine complex K can be 
made into a topological complex by giving it the finest ([2], p. 9) topology 
| consistent with condition (a). That is (cf. [11], p. 316), a set of K is called 
| open if and only if the intersection with each closed cell é, is the image by ¢a 


| of an open set of Hy. If U is any open set containing a given point 2 of K, 
i let V be the set of points y of St e(x) such that the segment [z, y] in é(y) 
'  iscontained in U. It can be seen that the intersection of V with each closed 
cell é, is an open set of é,; hence V is open. Clearly xe V. Thus condition 
(b) is satisfied. This finest topology will be called the Whitehead topology, 
and the resulting topological complex will be called a Whitehead complex. 
It is known ([11], p. 320; [12], p. 225) that a Whitehead complex is a 
normal Hausdorff space. 


1“ Fine” in the sense of Bourbaki means “ weak” in the sense of Whitehead or 
“strong ” as used in functional analysis. 


| | 

4? 

at 

in 

et 

ed 

if 

h- 
nt 

ly 

ch 

ny 

ch 

he 
ple 

on | 
he 
K. 
1), 

ng § 

or | 


C. H. DOWKER. 


An equivalent description of the Whitehead topology is that a set of K is 
called closed if and only if its intersection with each closed cell é, is the image 
by ¢a of a closed set of EF. 

It can be shown ([11], pp. 316-317) that if K is star-finite (in particular, 
finite), the Whitehead topology is the only one which satisfies conditions (a) 
and (b). Accordingly, given a star finite affine complex K, one may speak 
unambiguously of an open (or closed) set of K. 


(4.1) If L ts a subcomplex of a Whitehead complex K, the subspace 
topology of L coincides with its Whitehead topology.” 


Proof. If A is a subset of LZ closed in the Whitehead topology of L, 
then A éq is closed in é, for every é, in L, hence also for every @ in K. 
Therefore A is closed in K, and hence A = AN L is closed in the subspace 
topology of Z. Thus the Whitehead topology is not finer than the subspace 
topology, and the two topologies coincide. 


(4.2) <A transformation f of a Whitehead complex K into a topological 
space Y is continuous if and only if f|éq is continuous for each closed cell 
of K. 

Proof.® If f is continuous, f-?(V) is open in K for each open set V of Y; 
hence f-*(V)/M éq is open in é, and f|é is continuous. Conversely, if f|é. 
is continuous, f*(V)/M é@ is open in ég, and hence f*(V) is open in K; 


therefore f is continuous. 


(4.3) A transformation f of a Whitehead complex K into a topological 
space Y is continuous if and only if f|L is continuous for each finite sub- 


complex L of K. 


Proof . If f is continuous, then f|Z is continuous for every subspace L, 
in particular for each finite subcomplex. If f|Z is continuous for each finite 
subcomplex L, then in particular f|é, is continuous for each closed cell éq, 
and hence, by (4.2), f is continuous. 


5. The product complex. If K is an affine complex, we denote by Kw 
the same complex with the Whitehead topology. 

(5.1) Let K XL be the product of two affine complexes. Then tf 
either K or L 1s star-fimte, (K XK L)w = Kw X Ly. 


2 See Whitehead ([12], p. 224). 
*See Whitehead ([11], p. 317; [12], p. 224). 


562 
| 


TOPOLOGY OF METRIC COMPLEXES. 563 


Proof. By (8.1), Kw X Lw is a topological complex. Hence the 
product topology is not finer than the Whitehead “finest ” topology. It is 
then sufficient to show that if a set G is open in (K X L)w, ‘it is open in 
Kw X Lw; that is, for each point (2, yo) of G there exist open sets U and 
V of Kw and Ly respectively, such that xe U, ye V and U X VCG. 


Assume that it is LZ which is star-finite. For any cell ég of JL, 
&(ao) X = X &s)w is a finite subcomplex of (K X L)w, and 
Gig = GN (é(xo) X w is open in XK Hence {y| y)e Gg} is open 
in Let H={y|(%,y)eG}; then HN {y|ye (20, y) G} 
= {y| (xo, y) © Gg}, which is open in ég. Hence H is open in Ly. ; 

Clearly y»e H. Since Lw is a normal space, we can choose an open set 
V of Lw so that ye V, VCH, and VC Ste(y). Let M=Ste(yo) ; since 
L is star-finite, M is a finite subcomplex of ZL. ThenyseVCVCHNM. Let 
the subset U of K be defined by U={r|4 XK VCG}. Since VCH, aeU. 
Hence (2, ¥o) €U X VCG. Thus there remains to be shown only that U 
is open in Ky. 

If eg is a cell of K, 6 XK M= (éa X M)w is a finite subcomplex of 
(KX L)w. Hence Ga =GNM(é. X M)w is an open set of X M. Then 
UN = X VCG} = X V CG4}, which is open in éq, since 
G, is open and V is compact. Hence U is open in the Whitehead complex Ky. 
This completes the proof of (5.1). 

The condition of star-finiteness of one factor can not be dropped as the 


following example shows. 


Example. Let K consist of a collection of closed 1-cells A; of the power 
of the continuum, with a common vertex uw». Let ZL be a countably infinite 
collection of closed 1-cells B;, 7 == 1,2,- +--+, with a common vertex 
Then (K X L)wA Kw X Lw. 

Proof. Let the closed 1-cells A; have parameters z;, 0 = 2%; = 1 so that, 
at the point wo, 7; 0 for all 1. Let the closed 1-cells B; have parameters yj, 
0Sy;S1, and at vo, let y; 0 for all 7. Let the indices 1 be sequences of 
integers; i = {i,,%2,: -*}. For each pair (1,7) of indices, let pi be the 
point (1/1;,1/i;) of Ai X B;CK X L, and let P be the set of all such points 
pi. Then for each A; X B;, PN (Ai X B;) consists of one point pi, hence is 
closed in A; X B;. Thus P is closed in (K X L)w. 


A neighborhood U of wu) in Kw is given by 2; < a, where the a; are 
positive numbers. A neighborhood V of v, in Lw is given by yj < bj, b; > 0. 
*See Whitehead ([12], p. 227). 


3 


is 
ge 
ar, 
a) 
ak 
Lce 

[, 

al 

ell 

: 

cal : 
ub- 

L, 
ite 

if 

q | 


564 C. H. DOWKER. 


Let U XV be a product neighborhood of (u,v) in Kw X Lw. Let 
i = {i,,i.,- - -} be chosen so that for each j, i, > j and i; >(b;)"*. Let 7 be 
chosen so >(a;)"%. Then *<(j)*< a, and bd; (Hence 
pzeU XV. Therefore every neighborhood of (uo, v9) in Kw X Lw contains 
a point of P. Hence, since (uo, vo) ¢ P, P is not closed in the product topology. 
Thus the product topology is not so fine as the Whitehead topology.° 


(5.2) Let K be a Whitehead complex, let I be the closed interval 
0=t<1,and let Y bea space. Let h be a function from K XI to Y such 
that, for each closed cell é, of K,h | €, Lis continuous. Thenh: K 
is continuous.® 


Proof. We regard IJ as a complex consisting of one 1-cell and two 0-cells. 
Then by (5.1), since J is a finite (and hence star-finite) complex, K X I isa 
Whitehead complex. But for each closed cell ég of K X I, there is a closed 
cell of K such that é;C XJ. Therefore, since X I is continuous, 
h |g is continuous. It follows from (4.2) that h is continuous. 


6. Subdivisions of Whitehead complexes. One can show that any sub- 
division of a Whitehead complex is a Whitehead complex. More exactly, we 


have 


(6.1) If Sd: K—K’ is a subdivision of an affine complex K, and tf 
both K and K’ are given the Whitehead topology, then Sd is a homeomorphism. 


Proof. Sd-* maps each closed cell of K’ linearly, and hence continuously, 
into a closed cell of K. Hence Sd-? is continuous. Sd maps each finite sub- 
complex Z of K onto a finite subcomplex L’ of K’, and Sd/Z is piecewise 
linear, hence continuous. Thus Sd is continuous, and therefore a homeo- 


morphism. 


(6.2) Isomorphic Whitehead complexes are homeomorphic. 


Proof. Let K and L is isomorphic Whitehead complexes. If K and L 
are simplicial, the natural barycentric map of K on JL is linear and hence 
continuous on each closed cell, and its inverse has the same property. Hence 
the natural barycentric map is a homeomorphism. If K and ZL are not sim- 
plicial, then by (2.1) they have isomorphic, and hence homeomorphic, 
simplicial subdivisions K’ and L’. But by (6.1) K is homeomorphic to XK’, 
and Z to L’. Hence K and LZ are homeomorphic. 


5 This answers a question of Whitehead, see [12, page 227, footnote 21]. 
See Whitehead ([12], p. 228). 


ag 

4 

a 

| 


TOPOLOGY OF METRIC COMPLEXES. 565 


7. Coverings. A covering of a space X is a collection {Uq} of open 
set whose union is X. A covering {Ug} is called locally finite if every point 
of X has a neighborhood which meets only a fiaite number of sets Ug of the 
covering. A covering {Vg} is called a refinement of {Ua}, if each Vg is 
contained in some Ug. If K is a topological complex, the star of each vertex 
Vg of K is an open set, and every point of K is in the star of some vertex; 
hence {St va} is a covering of K. 

(7.1) If K ts a Whitehead complex and if {Ua} is a covering of K. 
there exists a simplicial subdivision Sd: K — K’ such that the covering of K’ 
by the stars of its vertices is a refinement of the covering {Sd Ug}. 


For the proof see Whitehead ([11], Theorem 35). 

Let X be a topological space, let 11 be a covering of X, and let NV 
(= N(1)) be the nerve of U topologized as a Whitehead complex. Then NV 
is a simplicial complex with a vertex wu, corresponding to each non-empty set 
of U, and a simplex uq,,° * *, Ua, corresponding to each non-empty inter- 
section U,,1---:1 Ug, A mapping ¢: X—WN is called canonical with 
respect to Ul if for each Uge Ul, ug C Ug. 


(7.2) If Wis a locally finite covering of a normal space X, there exists 
a canonical mapping of X into the Whitehead nerve N of WU. 


Proof.’ Let M be the nerve of 11 metrized with the “natural ” metric 
({9], (4.12)). Then there exists ([4], 3(a) ; [3], Theorem 1.1) a canonical 
map 6 of X into M. Now M and N are the same affine complex with two 
different topologies ; let x: M—N be the identity map. Then x is linear and 
hence continuous on each closed cell of M. Hence x is continuous on each 
finite subcomplex of M. Hence by [3], Lemma 1. 2, x6: X — N is continuous. 
Let ¢ = x6; clearly ¢ is a canonical map of X into N. 


8. Homology and cohomology groups. Let K be a Whitehead complex, 
and let | K | be its underlying space. If K’ is a subdivision of K, we identify 
the spaces | K | and | K’ | by means of the homeomorphism Sd: | K | > | K’ |. 
By the Cech cohomology groups of K we shall mean the Gech cohomology 
groups ® of the space | K |. 

(8.1) The Cech cohomology groups of a Whitehead complex K are 
isomorphic with the corresponding combinatorial cohomology groups of K. 


*It may be seen that the proof of [3], Theorem 1.1 or the proof outlined for [4], 
proposition (a) does not depend on the special topology of the nerve; thus either of these 
gives a direct proof of (7.2). 

* For definition and properties of Gech cohomology groups see [5]. 


: 
be 
ce 
ns 
By. 
ch & 

Y & 
ls. 
a 

ed 

b- 
we 
if 
m. 
ly, 
b- & 
ise 
e0- 

Le 
ce 
nee 
m- 
ic, 

Vd 

? 


566 Cc. H. DOWKER. 


Proof. Let K’, be a fixed simplicial subdivision of K. For each sim- 
plicial subdivision K’, of K’o, the nerve of the covering WU, of | K | by the 
stars of the vertices of K’, is a complex which can be identified with K’, itself. 
By (%.1) these coverings U, of | K | by stars of vertices of simplicial sub- 
divisions of K’, form a cofinal family of coverings. By (2.3), the projection 
map 7a. of the nerve K’, of U, into the nerve K’, of Uo, induces an iso- 
morphism 7*,. of H?(K’)) onto H?(K’,). If K’, and K’s are two simplicial 
subdivisions of K’, such that Ug is a refinement of Uq, then ([5], p. 282) 
a* go = ao: H?(K’)) > and since and are isomor- 
phisms onto, = qo: H? (K’,) H?(K’g) is also an isomorphism 
onto. 

Since the coverings 11, form a cofinal family of coverings of | K |, the 
Cech cohomology group H*(| K |, G) of | K | based on a discrete coefficient group 
G is the limit group of a direct spectrum, the groups of which are the coho- 
mology groups H?(K’,, G) of the simplicial subdivisions of K, and the homo- 
morphisms of which are the homomorphisms H?(K’,, G) H?(K’s, G). 
Since by (2.3) each H?(K’,,G) is isomorphic with H?(K,G), and since 
each x*g, is an isomorphism onto, it follows that the limit group H?(| K |, G) 
is also isomorphic with H®(K,G@G). Thus the Cech cohomology group 
H*(| K |,@) of K is isomorphic with the combinatorial cohomology group 
H?(K, G). 

Another consequence of (7.1) is the following result. 


(8.2) The singular homology (cohomology) groups of a Whitehead 
complex K are isomorphic with the corresponding combinatorial homology 
(cohomology) groups of K. 


For the proof in case K is simplicial see ([7], pp. 399-400). If K is 
not simplicial, we replace it by a simplicial subdivision K’, where, by (6.1), 
| K | and | K’, | are homeomorphic and, by (2. 3), the combinatorial homology 
and cohomology groups of K are isomorphic with those of K’y. 


IV. Metric complexes. 


9. Definition and properties of metric complexes. A metric complex 


is a topological complex whose underlying space is a metric space (a topological 
complex whose underlying space is metrizable will be called a metrizable 
complex). Replacing condition (b) of section 3 by the equivalent condition 
(b’), we can say that a metric complex is an affine complex K whose under- 
lying set is a metric space subject to the conditions 


(a) 


Each $a: is a homomorphism, 


TOPOLOGY OF METRIC COMPLEXES. 567 


(b’) For each point z of K and each positive number e, there exists a 
positive number 8 such that ° if p(z, y) < 8, then ze é(y), and for each point 
z of the segment [x,y] in é(y), p(z,2) <e. 


Every subcomplex of a metric complex is a metric complex, for every 
subspace of a metric space is a metric space, and every subcomplex of a 
topological complex is a topological complex. The cartesian product of two 
metric complexes is a metric complex, for the product space of two metric 
spaces is a metric space, and the product of two topological complexes is a 


topological complex. 

Let K be a metric complex. We define y(z,¢) as follows: Let 2y(z, «) 
be the least upper bound of the 8’s of condition (b’) if this least upper bound 
exists and is less than ¢; if the 8’s are unbounded, or if their least upper 
bound is not less than ¢, let 2n(z,«) =«. Thus n(z,¢) is defined for each 
and each «>0, and Se/2. Clearly if «<< then 
If ye K, and p(z,y) < 2y(z, €), then the segment [z, y] 
exists in é(y), and for each ze [z, y], p(z,z) <e. 

We define 7,-(z,¢«) inductively as follows, for r—=0, 1,2,---. Let 
no(@,€) =e, and for r=1, let (x, €) = 7(2, One sees imme- 
diately that (z,«) —7(2,€), and It also follows 
from the definition that for r>0, »-(z,¢) S4$r-1(2,€), and for r= 2, 
S €). 

(9.1) For any metric complex, if r= 1, then n(x, €) = nr-1(2, n (2, €)). 


Proof. This is clear for r=1. We proceed by induction. Let r= 2, 
and assume = (2, €)). Then = (2, €)) 
= nr-2 (©, (2, €))) = €)). 

We write S(z,«), or So(z,¢), for the set of points y of K such that 
p(z,y) <«, and S,(z,¢) for the neighborhood S(z, ,(z,¢€)). We write {p} 
for the set consisting of one element p. 

(9.2) Let A bea set of r points (r=1) of a closed cell é of a metric 
complex K, let « be a positive number, and let < be a point of A such that 
= n(x, for allae A. Then tf there is some point y of such that 
yeS,(z,«) for all xe A, the convex hull of {y}UA in é@q is contained in 
S(Z, ). 

Proof. First let r—=1. Then A = {2}, and the convex hull of {y}UA 
in is the segment [Z,y] in é. Since ye = S(Z, n(%,€)), p(Z, y) 
< 9(Z,€), and hence if ze [Z,y], p(Z,z) <<«. Thus y]CS(Z,e). 


® We write p(x,y) for the distance between the points 2 and y. 


] 
e 

n 


568 C. H. DOWKER. 


We proceed by induction. Let r= 2, and assume the result has been 
proved for r—1. Let B= AW— {2}, let B, = {y} UB, and let A, = {y} VA. 
Then for any point z of the convex hull A,* of A, in ég, ze [@,y,], where y; 
is a point in the convex hull B,* of B, in é. By the induction hypothesis, 
since ye (2, €)) CS,1(2, for each re B, B,* CS (4, n(Z, €)) 
for some Ze B. Hence p(Z,y,) < €), and p(Z, 41) S y) + p(y, Z) 
+ p(Z, < ar(%, €) + €) + (4, €) < €) + €) €) 
< 2n(%,¢€). Therefore p(z,z) and A,*CS(Z,e). 


(9.3) Forr=1 let A be a set of r points, not necessarily distinct, of a 
closed cell é, of a metric complex K, and let « be a positive number. Then tf 
C is a non-empty convex set of &q contained in () S,(a,¢), the conver hull 
of CUA in é has diameter less than 2¢. - 


Proof. Note that (C UA)* is the union of the sets ({y}UA)* for all 
yeC. By Lemma 2, each ({y}UA)*CS(2@,€), where @ is a point of A for 
which »(%,¢«) is maximal. Hence (C UA)*CS(#,«), and the diameter of 
(C UA)* is less than 2¢. 


10. The conditions to be satisfied. Let K be a metric complex. For 
each positive integer n we choose a collection 1" of open sets of K, and with 


each open set Uy of 1” we associate a point x, of K. (The indices A of the 
sets of 11" are assumed to be elements of some sufficiently large index set.) 
With each cell e, of K and each positive integer n, we associate a subcollection 
U," of U" and a positive real number pg”. The choice of WU”, 7,11," and pa” 
will be subject to the following conditions: 


1) Ua." 
2) WU,” is a finite collection of open sets whose union contains ég. 
3) If then © é@, and, for any ye Uy and ze K —Ste(z), 
< 2). 
If eg < eg, Ug"C 
5) If, for eg Seg, Uxe Ug", and Ug", then U) € Ug". 
6) Let eg Se,, and let C be a convex set in é,. Let Uy,,- +: -, Uy 


be sets of U,", and let U,,,- - -, Uy, be sets of U,"** such that the intersection 


is not empty. Then if C has diameter less than 2p", the convex hull of the 
union CU +, in é, has diameter less than 1/n. 


TOPOLOGY OF METRIC COMPLEXES. 


Note that condition 3) implies condition 3’). 


3’) If Uy e Ug", then e(2,) S ea, and St e(a2). 


Also, if condition 3) holds for each of two cells eg and eg, then condition 
3”) holds. 


8”) Ti Uy e Ug" and Ug”, and if U.N 0, then either 


For if neither of e(x,) and e(z,) is a face of the other, then 2) € e(%,) CK 
— Ste(z,), and z, CK —Ste(2). Let re U,NU,. By 3) applied 
to €a, 2%) < 2), and by 3) applied to eg, < 2). 
Therefore p(x), > + which is absurd. 

Also, if conditions 2) and 5) hold for eg and all of its faces, the following 
condition 5’) also holds. 


5’) If, for eg S eg, Uxe Ug" and ég, then Uy Ug". 


For by 2). Ug" covers ég, and hence for some U, € Ug", x, € U,. Thus by 5), 
Uxe€ Ug". 


11. The construction. We shall first construct the collections of 
open sets, and afterwards we shall use condition 1 to define 11". The con- 
struction of 11,” will be by induction on the dimension of the cell eg. 

First let eg be a cell of dimension zero; that is, eg consists of a single 
point v. If v is the only point of K, let U,” consist of the one set U, = {v}, 
let 2, be the point v, and let pg™—1. If K has other points, and hence 
other vertices, let d(v) be the distance from v to the complement of the open 
set St ea, and let 11,” consist of a single open set U, which is the spherical 
neighborhood of v with radius the smaller of $d(v) and n(v,1/2n). Let 2 
be the point v, and let 2p," = 7(v,1/2n). In either case the proof that con- 
ditions 2) to 6) are satisfied is easy, and is left to the reader. 

Now suppose that e, has dimension k > 0, and suppose that Ug", 2, and 
ps” satisfying conditions 2) to 6) have been constructed for all cells eg of 
dimenion less than & (in particular for all proper faces eg of e,), and for all n. 

For each n, let Ag” be the set of points of é which, for no proper face 
eg of eg, are contained in a set of the covering Ug”. Then A,” is a closed 
and hence compact subset of é, and Ag"C eg. Let 6," be the least value of 
ps” for all eg e, and all m= n. Since eg has only a finite number of faces, 
and since a finite number of positive integers precede n + 1, the number 6," 
exists and is positive. For each point x of A,”, let d(x) be the distance from 


569 
4 
} 
ll 
mre 
wae 
q 
wt & 
| 


570 H. DOWKER. 


x to the complement of St eg, and let N(x) be the spherical neighborhood of 
with radius r(az) equal to the smaller of $d(z) and (1/3) nex+2(2, 0a"). Since 
A,” is compact and of dimension S k, the covering of A" by the neighborhoods 
N(z) has as a refinement a finite covering 8," by open sets of K, such that 
no point of K is contained in more that k + 1 sets of Ba". For each set U) 
of B.", let x, be any one of the points of A," for which N(2,)DU). Let xq” 
be the least of the finite number of distances r(x), more explicitly r"(z), 
corresponding to the finite number of U) in Bq"; and let pq" be the smaller 
of the two positive numbers x,” and x,”*?. Let U4" consist of the sets of 8." 
together with all sets of Ug” for all proper faces eg of eg. 


12. Verification of the conditions. We now verify that the U,", 2, 
and pq” so defined satisfy conditions 2) to 6). 

2) The collection U1,” is the union of the finite collection @," and the 
finite number of finite collections Ug" for eg < eg. Hence U1,” is finite. For 
any xeéq, either re Uy € Ug" for some eg < eg, or te Ag” and is contained 
in a set of the covering 8,". Hence in either case, x is contained in a set 
of U,". Thus é, is contained in the union of the collection U4" of open sets, 
and condition 2) is satisfied. 

3) If Uye Bq", then Ag"C egC & Then, if ye U,C 
p(X, y) S4d(a), and if ze K = K — Steg. p(2), z) 
> d(x,); hence p(z,y) < 4p(z,z). If Uxe Ug" but Bq", then for some 
ep < @a, Uyxe Ug". Applying condition 3) to the lower dimensional cell eg, 
égC é,, and for ye Uy and ze K — p(t, y) < 2). Thus 
condition 3) is satisfied. 

4) By the definition of 11," condition 4) is satisfied. 

5) Condition 5) is trivial for eg eg. Assume eg < ég, Uy € Ug", and 
z,eU,eUg". Then # A”, and hence Uy, ¢ Bq". Thus for some ey < ég, 
Uxe,". Let es—e(z,). Then, by condition 3’) applied to eg, since 
U,€ Ug", we have es Seg, and U,C Stes. By condition 5’) applied to e5 
and eg, since U, € Ug", and 2, € &5, we have U,e€ Us". By 3’) applied to e,, 
since Uy, Uy", we have Se, But z,eU,C Stes; hence es S e(2), 
and es Se,. .By 5) applied to es and e,, since Uy” and U,e Us", 
we have U,e Us". Since es S eg, it follows from condition 4) that Us"C U,", 
Hence U) € Ug", and condition 5) is satisfied. 

6) Let eg Sey, and let C be a convex set in é, with diameter < 2p,". 
Let ye CN Ug, where each Uj is in U,", 
and each U, is in U,"**. Let up be the set (4° - *,2,), and let wp, be the 


set (2° 


TOPOLOGY OF METRIC COMPLEXES. 571 


Each point of K, and in particular y, is contained in at most k + 1 sets 
of B.". Hence at most k +1 of the sets U),,- --,U), are in Bq", and at 
most + 1 of the points of uw” are in eg. Similarly, at most k +1 of the 
points of w"** are in é@g. Let ég, eg; then 


has at most 2k + 2 points. 
If x,ev", then ye U,CN (zy), and hence p(x, y) Also 
diam C < 2pq” S2xq" S 2r"(x). Hence, if zeC, 


2) y) + p(y, 2) < S 60"). 


If z, ev", then ye U,CN(a,), and hence p(a,,y) <1" (z,). Also 
diam C < 2pq”" S 2xq"*! S 2r"*1(z,). Hence if zeC, 


2%) < S Ga"**). 


By definition, 6,” is a decreasing function of n; hence @,%1=6,", and 
(Ly, Oa"**) S (Ly, 9a"). Thus for each ze C, p(Xp, 2) < (Lp, Oa"). 
It follows that for each of the at most 2k + 2 points x of v™U v™, 
C is contained in the nex.2(2, 8") neighborhood of z. Hence by (8.3), the 
diameter of the convex hull of C U v” U y"*! in é, is less than 20". 

Let w” = uw" —v", and let w"*! —y™1, Then for each 2, w"U w™?, 
e(ay) < @q. Let be chosen in w” Uw"*' so that has maximum dimen- 
sion, and let eg Then if z,ew"U w™!, U, 0, and hence 
by 3”) applied to the faces e(rr)' and e(z,) of ég, either e(x,) > or 
e(zy) Since is of maximal dimension, e(z,) > is im- 


4 possible ; hence e(z,) S e(ar), and 2, € (rx) = Therefore w™ U C és. “| 

Let C’ = (CU U v™")*, «the convex hull of CUv* Uv", Then, ul 

; diam C’ < 26," S 2pg". Now C’ C é,, eg < ey, and y is in C’ and in each of iM 
the sets U, of for which z,¢w" U But by con- 
dition 5’), each set U, is in Ug" U Ug"**. Thus we can apply condition 6) “a 


to eg, and we find that (C’ U w™ U w"*")* has diameter less than 1/n. But t 


w* U ae ((C U om?) * U w*U 
= (CU o™ U w* U (CU 


Therefore (C U uw" U = (C U {a,,- - -,%,,})* has diameter less than 
1/n, and condition 6) is satisfied. ' 

Thus for each eg of dimension k, we construct 114", {2}, and pq” satisfying 
conditions 2) to 6). By induction on k, we have a family of U,", 2, and pa” 
satisfying conditions 2) to 6) for all eg and all n. Let W"=—lJqU.". Then 
all the conditions 1) to 6) are satisfied. 


t 
n 
r 
n 
e 
r 
) 
e 
] 
s 
| 
d | 
? . 
, 
’ 
e 


C. H. DOWKER. 


13. The families of open sets as coverings. We first verify that each 
family 1" is a covering and in fact a locally finite covering of the metric 


complex K. 


(13.1) Each of the families U", n=1,2,---, of open sets is a 


locally finite covering of K. 


Proof. Let xe K, and let eg =e(x). Then by condition 2), x is con- 
tained in some set of U,"C U". Therefore UW” is a covering of K. Let W, 
be the union of the open sets of U,"; then Wg, is an open set containing z. 
Let a(x) be a positive number less than half the distance from-z to the com- 
plement of the open set WaM St eq, and let G(x) be the spherical neighborhood 
of x with radius a(z). 


Let Uy, e U" — 4", and let e, = e(x,). Then for some es, Uy € Us", and 
Hence by 5’), Since Uy Ug", ey is not a face of eg, and 
SO = is not in the star of ey —e(x,). Hence re K —Ste(a). 

Suppose it possible that G(x)1U,~0, and let ye G(x)M Uy. Then 
by 3), since ye Uy e Uy”, and re K — y) < x). Also, 
since y € G(x), p(x, y) < a(x). Hence x) p(#y, y) + y) < 2) 
+ a(x). Therefore p(x,x) < 2a(x), and hence Steg. Thus for 
some x, Uy, and eg S =e,. Hence by 5), Ux which 
is absurd. Therefore G(z)1U,—0. Thus G(z) meets only a finite number 
of open sets of 11”, 11” is locally finite. 


(13.2) For any point xe K, let u be the set of all x for which 
xeUyeN", and let wu’ be the set of all x, for which reU,e WU. Then u 
and w’ are contained in é(x), and the convex hull of {x}UuUw in é(z) 
has diameter less than 1/n. 


Proof. Let eg =e(x). and let C be the set {7} consisting of one point. 
Then C is a convex set in é@, and diam C ==0 < 2p,"._ As in the proof of 
(13.1), there is a neighborhood G(x) of x which meets no sets of 11" — 11,". 
Hence the sets of UW" which contain z are in U,", and 
u = * *,2,} is contained in Similarly, the sets U,,,---,U of 
which contain are in and hence u’ = {2,,,° is contained 
in é. Then by condition 6), the convex hull of CUuUw = {z}UuU wv’ 
in é, has diameter less than 1/n. 


14. The mappings of K into K. Corresponding to each of the coverings 
U" of K we define a mapping ¥,: N" > K, where N” is the Whitehead nerve 


572 


TOPOLOGY OF METRIC COMPLEXES. 573 


of 11". For each vertex u, of the nerve N” there is a corresponding open set 
U)¢ U", and there is a point 2) of K associated with U); we define yn(u,) = 2). 
For each simplex o = u,° - Uy, of N”, there is some point xe K contained 
in the intersection U),M---:MU),. Among the cells of K which contain 
at least one of the points x,,° * -,2%,, let e, be one of maximum dimension. 
Then it follows from condition 3”) that e, is unique, 7,,° + *,%, are all 
in é,, and by condition 3’), e, Se(x). We define y,|o to be the linear map 
of o into é, determined by mapping each vertex u, of o into the corresponding 
ay, Since e,<e(zx), yn|o is also a linear map of o into é(x) for each 
teU,,N:*-NU),. Ifo, is a face of o, the point z is in each Uj, corre- 
sponding to a vertex u, of o,, and therefore y,|o,1 is also a linear map into 
é(z). Hence y,|& is a linear map into é(x), and ¥,|¢ is continuous. Since 
N” has the Whitehead topology, y,: N"—> K is therefore continuous. 

Let Kw be the complex K retopologized with the Whitehead topology, 
and let f: Kw — K be the identity map; i. e. for each xe Ky, let f(x) =aeKk. 
Since the topology of Kw is at least as fine as that of K, f is continuous. 
The map ¥,: N"—> K can be factored into a map yn: N"— Kw followed by 
f:Kw—K. Since xn is linear and hence continuous on each closed simplex 
of N", xn: N" > Ky is continuous. Let ¢, be a canonical map of K into the 
nerve NV” of Then ([3], p. 202) for each xe K, dn(x) &(x), where o(z) 
is the simplex of N” determined by x; the vertices of o(x) correspond to the 
open sets of containing z. If then 
and hence y, maps into é(z). Hence Let gn = xndn: 
K— Ky; then (see diagram) fgn = fxndn = non: K > K. Thus, for each 
re K, fgn(x) € 


pn 
K N 


15. The homotopy. We now define a homotopy 4:K XI—~K. For 
m=1,2,---, let h(a, 1/n) = fgn(x) = For1/(n+1) t<1/n, 
let h(x, t) be the point of é(x) which divides the segment from h(x, 1/(n + 1)) 
to h(z,1/n) in the ratio of t—1/(n+1) to 1/n—t. Let h(2z,0) =z. 
We must show that h: K * I—> K is continuous. 


i 
| / 
Xn “] 
Jn / Wn i 
f J 
| 


74 Cc. H. DOWKER. 


5 


Let ze K, and let eg =e(x). Let V be an open set containing z which 
meets no sets of 11" — 11,” and meets no sets of 11"** — 11,"*'; then VC St e,. 
Let J, be the interval 1/(n+1) S¢S1/n. Let (y,t)e V X In. Then since 
ye VC Steg eaSe(y). If ye we have Uy U,", and hence € é. 
Thus for each vertex u of o(y) in N”, yn(uy) € 2a; hence wm, maps &(y) into 
C é(y). Therefore, since ¢n(y) &(y), f9n(y) = Yndn(y) € a. Similarly 
f9ns1(y) € &a. Since eg S e(y), the segment joining fgn.1(y) to fgn(y) in é(y) 
is the segment joining them in é,, and the point h(y,¢) is the point dividing 
the segment from fgn.i(y) to fgn(y) in é in the ratio t—1/(n+1) to 
1/n—t. Thus h maps V X J, continuously into é, C K. It follows that 
for each n, h maps K X I, continuously, and from this it follows that h is 
continuously except possibly for ¢t = 0. 

We must show that h is also continuous at points (7,0) of K XJ. For 
any ze K and e>0, let U be the (¢«/2)-neighborhood of z in K, and let W 
be the subset {¢|0 <¢ < 1/m} of I, where m is some integer greater than 2/«. 
Then U X W is a neighborhood of (z,0) in K XJ. Let (y,¢) be any 
point of UX W. If t=0, h(y,t) =y, h(x, 0) and p(h(z, 0), h(y, t)) 
=p(z,y) <<«/2<e. If, on the other hand, ¢ >0, then for some n= ™m, 
1/f(n+1)StZS1/n. If +, Uy, are the sets of containing y, then 
on(y) € where o(y) is the simplex u,,° +, of N". Since = 
and since ¥,|&(y) is a linear map into é(y), y, maps o(y) into the convex 
hull of in é(y). Similarly if are the sets of 
h(z,t) is on the segment from to Yadn(y) in é(y), (2, 
{2° in é(y). By (12. 2), diam {y, -,%.}* < 1/n; hence 
p(y, h(z,t)) <1/nS1/m < «/2. Because ye U, p(z,y) <¢/2. Hence, 
since = h(z,0), p(h(a, 0), h(y, t)) < «/2 + €/2 Thus h is continuous 
at (z,0), and therefore is continuous. 

We have shown that h: K X I— K is a homotopy.” Since h(z,0) —g, 
and h(z,1) = fg:(x), we have 


(15.1) The map fg,:K—K is homotopic to the identity. 


We have in fact proved much more. We have shown that if0 = t= 1/n, 
then p(y, h(y,t)) <1/n. Hence if 0 1, p(y, h(y,t/n)) <1/n. Thus 
if we set h,(z,t) —h(z, t/n), we have 0) and h,(z, 1) 
=h(z,1/n) = Yndon(z), and therefore h, is a homotopy of the identity 


2° Actually the homotopy is a uniform homotopy. For definition of uniform homo- 
topy see [3], p. 204. 


575 


TOPOLOGY OF METRIC COMPLEXES. 


map of K in K to the map yn» such that, for each xe K and tel, 
p(2,An(x,t)) <1/n. Thus if we are given « > 0, and if we choose n > 1/e, 
then for every tel, hn(z,t) is within « of 2. We have therefore obtained 


(15.2)*2 Given any positive number e, the identity map of the metric 
complex K on itself is «-homotopic to a factored map nbn, where gn ts a 
map of K into a Whitehead complex N", and wu, is a map of N" into K. 
During the homotopy a point x does not leave the closure é(x) of the cell 


containing it. 


16. The homotopy type. Two spaces XY and Y are said to have the 
same homotopy type if there exist maps f:X > Y and g: Y-—¥X such that 
fg: Y — Y is homotopic to the identity, and gf: X +X is homotopic to the 
identity. Spaces of the same homotopy type can not be distinguished by any 
of the invariants of algebraic topology. Our main theorem is that a metric 
complex and the corresponding Whitehead complex have the same homotopy 
type. 


THEOREM 1. If K is a metric’? complex and Ky ts the complex re- 
topologized with the Whitehead topology, then K and Kw have the same 
homotopy type. 


Proof. We have maps f: Kw > K and g,: K — Kw such that, by (15.1), 
fg.:K — K is homotopic to the identity. It is then sufficient to show that 
uf: Kw— Kw is homotopic to the identity. We define the homotopy 
h: Kw X I— Kw as follows. For each re Kw, gif (x) = (x) we ud 
define h(a, ¢) to be the point which divides the segment from x to g,f(x) in +i 
é(z) in the ratio t:1—#. For each cell of Kw, if we é, then S “f 
and hence h(z,¢) is the point dividing the segment from z to g,f(x) in é if 
in the ratio ¢:(1—t). Thus h]é XJ is continuous, and by (5.2), i 
h: Kw X I— Ky is a homotopy. 


THEOREM 2. Isomorphic metric complexes have the same homotopy type. 


Proof. Let K and ZL be isomorphic metric complexes. Then Kw and 
Lw are isomorphic Whitehead complexes. Hence by (6.2), Kw and Lw are 
homeomorphic and a fortiori of the same homotopy type. By Theorem 1, 


11JIn the terminology of Lefschetz ([9], p. 98), (15.2) says that the identity 
mapping of K on itself is e-deformable, for all e, to a mapping into a continuous complex. 
We may interpret this as meaning that K is an absolute neighborhood retract. 

12 In this and the following theorems it is sufficient to assume that K is a metrizable 
complex. 


|| 

h 

0 

y 4 

it 

1S 

V 

y 

ly 

1 

7 j 

x 

f 

a 


576 C. H. DOWKER. 


K has the same homotopy type as Kw, and L has the same homotopy type as 
Lw. Hence K has the same homotopy type as L. 


17. Canonical mappings. We now show that the theorems on canonical 


mappings ** into the nerve of a covering hold equally whether the nerve is 
provided with a metric or with the Whitehead topology. 


TuEorEM 3. Let X be a topological space, let U be a covering of X, 
let N be the nerve of U with the Whitehead topology, and let M be the nerve 
metrized in some way to form a metric complex. Then there exists a canonical 
map of X into the nerve N of U tf and only if there ts a canonical map of X 
into the nerve M of U. 


Proof. First let ¢ be a canonical map of X into N, and let f be the 
identity map of NV (= My) onto M. Then f¢: XM is continuous. Also, 
for each vertex of M, (f¢)*St u(u,) = w(u,) CU). Hence f¢ is a 
canonical map of X into M. 


On the other hand, let y be a canonical map of XY into M. Let g bea 
map of M into N (= Mw) (see § 14) such that, for each re M, g(z) € é(z) 
in N. Then gy is a map of X into N. If g(r) ee, re Steg; hence 
geaC Steg. If e,C Steg, g-te, C Ste,C Steg. Hence veg C Steg. 
It follows that (gy)? Sty u = Sty uC Sty uy C Uy. Therefore gy 
is a canonical map of X into N. 


Corotuary 1. Let X be a normal space, and let U be a covering of X. 
Then there is a canomcal map of X into a metric nerve M of U, or into the 
Whitehead nerve N of U, tf and only tf UV has a locally finite refinement. 


Corotiary 2. Let X be a topological space. There is a canonical map 
of X into a metric nerve M or into the Whitehead nerve N of every covering 


U of X if and only if X ts paracompact and normal. 


Proof. These results have been proved for particular metric nerves ([4}, 
p. 388). It follows from Theorem 3 that they hold for Whitehead nerves. 
By another application of Theorem 3 they hold for any other metric nerves. 


18. Topological invariance. Finally we show the topological invariance 
invariance of the homology and cohomology groups of metric complexes. 


THEOREM 4. The combinatorial homology and cohomology groups of a 


18 For definition of canonical mappings see section 7 above. 


TOPOLOGY OF METRIC COMPLEXES. 


metric complex are isomorphic with the corresponding singular homology 
groups. If the coefficient group is discrete, the combinatorial cohomology 
groups are isomorphic with the Cech cohomology groups. 


Proof. The singular homology and cohomology groups and the Cech 
cohomology groups are invariants of the homotopy type ([7], p. 400; [5], 
p. 28%). Hence by Theorem 1 they are the same for K and for Kw. By 
(8.1) and (8.2) they are therefore isomorphic with the corresponding com- 
binatorial homology and cohomology groups. 


Corontary. Jf K and L are homeomorphic metric complexes, their 
combinatorial homology and cohomology groups are isomorphic. 


HARVARD UNIVERSITY. 


BIBLIOGRAPHY 


[1] P. Alexandroff and H. Hopf, Topologie, Berlin (1935). 
[2] N. Bourbaki, Lléments de Mathematique II, (Actualités scientifiques et indus- 
trielles, No. 858), Paris (1940). 
[3] C. H. Dowker, “ Mapping theorems for non-compact spaces,” American Journal of 
Mathematics, vol. 69 (1947), pp. 200-242. 
[4] , “An extension of Alexandroff’s mapping theorem,” Bulletin of the 
American Mathematical Society, vol. 54 (1948), pp. 386-391. 
[5] , “€ech cohomology theory and the axioms,” Annals of Mathematics, vol. 51 
(1950), pp. 276-292. 
[6] S. Eilenberg, “ Singular homology theory,” Annals of Mathematics, vol. 45 (1944), 
pp. 407-447. 
[7] W. Hurewicz, J. Dugundji and C. H. Dowker, “ Continuous connectivity groups in 
terms of limit groups,” Annals of Mathematics, vol. 49 (1948), pp. 391-406. 
[8] S. Lefschetz, Algebraic Topology, New York (1942). 
[9] , Topics in Topology, Princeton (1942). 
[10] K. Reidemeister, Topologie der Polyeder, Leipzig (1938). 
[11] J. H. C. Whitehead, “ Simplicial spaces, nuclei and m-groups,” Proceedings of she 
London Mathematical Society (2), vol. 45 (1939), pp. 243-327. 
[12] , “Combinatorial homotopy I,” Bulletin of the American Mathematical 
Society, vol. 55 (1949), pp. 213-245. 


e 
e 
a 
a 
e 
if 
p ihe 
] 
l 


ON THE UNBOUNDEDNESS OF THE ESSENTIAL SPECTRUM.* 


By C. R. Putnam. 


1. In the differential equation 
(1) + (A—f)c=0, 


let A denote a real parameter, and let f f(t) be a real-valued continuous 
function on the half-line 0 =[¢<o. (Throughout this paper only real-valued 
functions will be considered.) In addition, suppose that f is such that the 
differential equation (1) is of the Grenzpunkt type, so that (1) possesses for 
some A (and hence for every A) a solution x which fails to belong to the class 
L? = L*[0,0); cf. Weyl [11], p. 238. In this case, the equation (1) and a 
linear, homogeneous boundary condition 


(2) cosa+ 2’(0) sina 


determine a boundary value problem on 0 =¢<o with a spectrum S,. If 
S’ denotes the (closed, possibly empty) set of cluster points of S,, then 8’ is 
independent of @ ([11], p. 251), and is called the essential spectrum of (1). 

Various results concerning the set S’ are known in case f is subject to 
certain restrictions. If, for instance, f satisfies 


(3) f(t) as 


then S’ is the half-line ct <0; [3]. In general, the complement of 8’ 
is an open (possibly empty) set, and hence possesses a decomposition ¥ (Ax, A*) 
into open intervals Ay << A< A* (or “gaps” in S’), where it is understood 
that one, or possibly two, of the intervals are half-lines, and that the summa- 
tion may consist of the single interval —o<A<o in case S’ is empty. 
If f is bounded, so that 


(4) | f(t) | < const., 0St<o, 


it is known that, except for (Ao, A°) =(—, A°), the inequality A* — A, const. 
holds, so that, in particular, S’ is unbounded from above ([8]), and if certain 
extra conditions are placed on f, even asymptotic estimates of the gaps in 9’ 


can be given; cf. [6]. 


* Received September 15, 1951. 


578 


| 
§ 


ON THE ESSENTIAL SPECTRUM. 579 


In most instances, the set S’ is, if it is not empty, unbounded either 
from above or from below. However, examples of functions f for which S’ 
consists of a single point are known; [4], Corollary 3, p. 110. Thus the 
set S’ can be non-empty and yet bounded. 

It is natural to ask under what general condition on f is the set S’ 
unbounded when it is not empty. It turns out that a sufficient condition is 
that f be bounded from below, thus, f should satisfy 


(5) f(t)> const., 0St<o. 


(It should be noticed that (5), and hence (4) or (3), imply that (1) is of 
the Grenzpunkt type; [11], p. 238.) Furthermore, as will be shown in 
Theorem I below, the restriction (5) even permits an asymptotic estimate of 
the gaps in S’. For convenience in the sequel, the following terminology 
will be introduced: Let ma(A) =min|A—jp| when p» is in Sq, and let 
m(A) = min |A—jy| when is in 8S’. Thus mg(A) and m(A) denote the 
distance from a fixed value A to the nearest point of the set S, or S’. Clearly 
m(X) =o for some A (and hence for every A) if and only if S’ is empty. 
The following will be proved: 


THEOREM I. Let f(t) bea real-valued continuous function on0 St <0 
satisfying condition (5). Then one of the following two possibilities must 
occur: either (i) the set S’ is empty or (ii) S’ is not empty and ts unbounded 
from above. Furthermore, in case (ii), m(A) satisfies 


(6) m(A) == O(AS), 
The main tool used in this paper will be the lemma (*) of section 2 
from which certain estimates for m(A) will be derived; various consequences 


of (*), in addition to Theorem I, will be set forth in Theorems II-IV in 
section 4, 


2. Consider the boundary value problem determined by (1) and (2) 
for a fixed a, and let A,,A2,- - - denote the eigenvalues (if any) and 
¢1,2,° the corresponding normalized eigenfunctions. Then for A=; 
and t = ¢;, equation (1) becomes 


(7) + (Aj —f) $j = 0. 


Let L(x) be defined by L(x) = 2” — fz, and let g denote any function of 
class L? satisfying the boundary condition (2), for which D(g) is defined, 
continuous, and of class L?; thus, 


+ 
, 
ae 
a® 
| 
bd 4 


580 C. R. PUTNAM. 


Since g satisfies (2), (0)9(0) —¢4;(0)9’(0) =0; in addition, by (8), 
($; (t)g(t) — $;(t) 9 (t)) as to (ef. [11], pp. 241-242). Multi- 
plication of (7) by g followed by an integration readily leads to 


(9) +raedt = gaat, 


for an arbitrary real number A. Two applications of the Parseval relation 
applied to the functions L(g) + Ag and g yield, in virtue of (9) and a similar 
relation in which the ¢; are replaced by the eigendifferentials corresponding 
to the continuous spectrum, the inequality 


(10) f (L(g) + Ag)*dt = f ; 
0 
ef. [9], p. 140, for calculations of a similar nature. 


First suppose that S’ is not empty, so that m(A) < (for all A), and 
suppose mg(A) << m(A). Let g = gn, n= 1,2,- - -, denote any sequence of 
functions of the type considered above in the derivation of (10). In addition, 


suppose that 


(11) f / gn'dt > 0, 
70 


holds for every fixed T satisfying 0 = T <0. Let « denote a small positive 
number, and consider the A-interval [A — m(A) + «A+ m(A) —e]. On this 
interval there exist at most a finite number of points Aj, As,- - -,Aw (eigen- 
values) in the spectrum of S,. It follows from the Schwarz inequality and 


(11) that 
N 

(12 gu2dt 0, 
g=1 0 70 


It is now an easy consequence of (12) that 
Lim sup -+ Age?) at = lim sup (m(A) 
0 0 


whenever « << m(A) and the functions g =~ g, satisfy (11). Since e is arbi- 


trary, one can obtain even 


(13) lim sup | (L(gn) + Agn)?dt = lim sup[m?(A) gn7dt |. 
n—>00 0 70 


Next, let y denote any function satisfying (2) for which y, y’, and L(y) 
are continuous and belong to L*. Furthermore, let A>0, and put h = cos (At) 
andg=yh. It is readily verified that g cos a + g’ sin a = h(y cos a + 7 sin 2) 


ON THE ESSENTIAL SPECTRUM. 581 


+ yh’ sina, so that, since y satisfies (2) and since h’ =— Aisin (Ait), g 
clearly satisfies (2). Furthermore, L(g) + Ag = L(y)h + 2y’h’, so that the 
left side of (10) becomes 


(14) + tat. 
0 
One readily verifies as a consequence of the equation 
cos? (Ait) = $(1 + cos ) 
and an integration by parts, that the integral on the right side of (10) is 
(15) f y?h?dt = 4 f y’dt — f yy’ sin (2A3t) dt. 
0 0 0 


(Use is made of the limit relation y(t) +0 as t->00; this, in turn, is a 
consequence of the fact that y and y’ belong to Z*.) An application of the 
Schwarz inequality to the second integral on the right side of equation (15) 
shows that 


(16) f yy’ sin (2A8¢t) dt = yedt 
0 


If use is made of the inequality (a+ b)* = 2(a? + b?), where a and b are 
real, it is seen that (14) is majorized by 


Furthermore, if y is normalized by 


(18) f = 1, 
0 


it is seen from (15) and (16) that 


(19) f = 4— 
0 0 


Suppose now that y= yn, n=1,2,- - -, denotes any sequence of func- 
tions, satisfying the conditions imposed on y above, and such that (11), in 
which gn = ynh, is valid for every fixed 7, where0=T <o. It then follows 
from (13) and the results obtained above that 


4 lim sup [(L (yn) )? + 


> m?(d)[1-—A4lim inf ( 
0 


on 
ar 
ig 
id 
of 
ny 
ve 
11S 
n- 
| 
at) 


582 C. R. PUTNAM. 


It is clear from (19) that the condition (11) surely holds if 


(21) 0< f — Yn 0, 
0 0 


Furthermore, it is clear from the above discussion, that (20) is also valid 
in case S’ is empty (so that m(A) =), with the understanding that the left 
side of the inequality of (20) may be oo. The results obtained thus far can 
be summarized in the following lemma: 


(*) Let {yn} denote any sequence of real-valued functions on0 St <o, 
satisfying the boundary condition (2) for xy, and the normalization 
condition (18) for y= Yn, and which, in addition, are such that yn, 4’, and 
L(yn) are continuous, belong to L?, and the inequality and limit relation of 
(21) are satisfied. Then the inequality (20) is valid for all A >0 (where 
m(A) So). 


The above lemma will be used in the next section to prove Theorem I. 


3. Proof of Theorem I. It is sufficient to show in this case that if S’ 
is not empty, then (6) must hold. For it is an obvious consequence of (6) 
that S’ must contain an infinity of points clustering at X = +o (and possibly 
elsewhere). But if 8’ is not empty, then S’ is translated by c if f(t) is 
replaced by f(¢) + c, for any constant c. Since (5) remains valid for f(t) + ¢ 
whenever it holds for f(t), it can be supposed that A = 0 belongs to the set 8’. 
Consider the fixed boundary value problem determined by (1) and (2) for 
a==(0. Suppose first that A—0 is a cluster point of eigenvalues A, with 
corresponding normalized eigenfunctions ¢,, so that 


(22) 
0 
It is clear that 


Furthermire, multiplication of the equation (7) by ¢, for 7 =n, followed by 
an integration, yields 


T 
(24) gn + J. (An f) n7dt, 


for every JT satisfying 0=7<o. However, it follows from (5) that 
—f < const., so that, since ¢,(0) 0, 


0 


lid 


hat 


ON THE ESSENTIAL SPECTRUM. 583 


It is known that ¢n(T)¢n’(T) +0 as T’'->00 (see the proof of the theorem 
of [12], p. 6; cf. also [2]) so that 


(25) on S const. (independent of n). 
0 


Furthermore, the limit relation $n(¢) —> 0 as n—>0o holds uniformly on every 
finite t-interval O=¢=T; cf. [10], p. 269.1 Hence, for sufficiently large A, 
it follows from (25), that (21) holds for y,—¢n. Moreover, it can be 
supposed that (for sufficiently large A), 


1 


1 — A+ lim sup Cf 
n->0O 0 


The lemma (*) is clearly applicable and shows that (20) holds and hence, 
by (22)-(24), 
(26) const. = 4m?(A) 


for large A. That is, (6) is satisfied and the proof of Theorem I is complete, 
at least if A — 0 is a cluster point of eigenvalues of the boundary value problem 
corresponding to a= 0. If this last condition fails to hold, then A = 0 is in 
the continuous spectrum of this boundary problem. (This alone would imply 
that S’ is unbounded; [13].) It will be shown that again (6) must hold. 
(The proof is essentially similar to that carried out in case A = 0 is a cluster 
point of eigenvalues.) One can obtain a sequence of functions y, of the type 
considered in (*), so that, in particular, (25) holds if $, is replaced by yn, 
and such that 


(27) Yn” + (An—f) Yn = In; 


where the k, are continuous functions satisfying 


(28) f —> 0, 
0 


see the lemma of [10], p. 269. In addition (23) is valid, and the limit relation 
Yn(t) +0, as holds uniformly on every fixed ¢-interval OS ¢ST; 
loc. cit., p. 269. Finally, a relation of the type (25), but where ¢, is replaced 
by yn, follows from (5), (27) and (28). Hence (26) can again be obtained 
and the proof of Theorem I is complete. 


1 Professor Wintner has pointed out to me that the passage occurring in [10], p. 267, 
referring to a classical theorem in [1], p. 278, should state that the methods of [1] 
imply that the class of an operator, on the L? space 0 = t < ©, is unchanged by adding 
to it a bounded operator. Cf. a corresponding remark in [7], § 7. 


oft 

an 

nd 

of 

_ 

6) 

oly 

is 

for 

ith 

| 


584 Cc. R. PUTNAM. 


4. In this section, a number of additional consequences of the lemma 
(*) of section 2 will be derived. First, a slight generalization of Theorem I 
is contained in the following 


THEOREM JI. The essential spectrum S’ is not empty and, tn fact, holds 
when the assumption that f satisfies (5) is replaced by the assumption that 
(1) be of the Grenzpunkt type and that there exist a sequence of real-valued 
functions y, satisfying the following three conditions: 


(i) the yn possess continuous second derivatives and satisfy the bound- 
ary condition (2) for a fixed a; 
T 
(11) f Yn7dt = 1, f >0 (as for every fixed positive 
0 0 


number T) ; 


Ce 
(iii) 4 <const., < const. 
0 0 


That Theorem II implies Theorem I is clear. In fact, the conditions 
specified in II were obtained, in the proof of I, as a consequence of the assump- 
tion that (5) holds and that S’ is not empty. The proof of III proceeds along 
the same lines as that of I, and can therefore be omitted. 


THEOREM III. Jf, in Theorem II, the first inequality in condition (iii) 
is strengthened to 


(29) f yn dt > 0, 
0 


then the assertion (6) of Theorem II can be improved to the statement 
(30) m(rA) =O(1), as 


If, in addition to (29), the second inequality of condition (iii) is replaced by 


(31) f (L (yn) )*dt > 0, 
0 


then the assertion of Theorem II can be improved to 
(32) the half-line 0=A <o is contained in S’. 

In order to prove Theorem III one need note only that (20) now yields 
(33) const. = m?(d) or 0=m*(Ad), 


according as (29) alone or both (29) and (31) are assumed. Since (33) is 
equivalent to (30) or (32), the proof of Theorem III is complete. 


ON THE ESSENTIAL SPECTRUM. 585 


THeorEM IV. Let f(t) be a real-valued continuous function on 
0<t<o satisfying (5), and suppose that, as to, the (finite) value 
u=lim inf f(t) belongs to the set 8’. Then the set S’ is precisely the half- 
line pS 


That S’ is contained in the half-line »=A< © is a consequence of the 
fact that the least point of 9’ is never less than p; cf. [5], p. 850. It remains 
to show then that every A >y (hence every A=yz) is in 8’. To this end, 
as in the proof of Theorem I, it can be supposed that f(¢) is shifted, if necessary, 
by a constant, so that »=0. Let A> 0. It will be shown that there exists 
a sequence of functions y, satisfying the conditions of Theorem III sufficient 
for the implication (32). For convenience it will be supposed that a —0 
in (2) and that » = 0 is a cluster point of eigenvalues. (Note that the set 9’ 
is independent of a; moreover, the case in which » = 0 is not a cluster point 
of eigenvalues, and hence is in the continuous spectrum, can be treated as in 
the proof of Theorem I.) Again, one can obtain relation (23). It is clear 
from (24), the fact that »—0, and from the properties of the functions ¢, 
occurring in the proof of Theorem [ that 


lim sup f dn dt = lim sup (— f fon?dt) = 0, (n—>0), 
0 0 


as an improvement to (25). The last formula line, for yn, =n, of course 
implies (29). It is now clear that the functions y, = ¢n satisfy the conditions 
of Theorem III guaranteeing the validity of (32). This completes the proof 
of Theorem IV. 


PURDUE UNIVERSITY. 


REFERENCES. 


[1] T. Carleman, Sur les équations intégrales singuliéres a noyau réel et syméetrique, 
Uppsala, 1923. 

[2] P. Hartman, “ The L?-solutions of linear differential equations of second order,” 
Duke Mathematical Journal, vol. 14 (1947), pp. 323-326. 

, “On the spectra of slightly disturbed linear oscillators,” American Journal 
of Mathematics, yol. 71 (1949), pp. 71-79. 

[4] ——_—, “Some examples in the theory of singular boundary value problems,” 
ibid., vol. 74 (1952), pp. 107-126. 


[3] 


a 
I 
t 
€ 
| | 


586 Cc. R. PUTNAM. 


[5] and C. R. Putnam, “The least cluster point of the spectrum of boundary 
value problems,” ibid., vol. 70 (1948), pp. 849-855. 

[6] and C. R. Putnam, “The gaps in the essential spectra of wave equations,” 
ibid., vol. 72 (1950), pp. 849-862. 

[7] and A. Wintner, “On perturbations of the continuous spectrum of the 


harmonic oscillator,” ibid., vol. 74 (1952), pp. 79-85. 


[8] C. R. Putnam, “ The cluster spectra of bounded potentials,” ibid., vol. 70 (1949), 
pp. 842-848. 


[9] » “On isolated eigenfunctions associated with bounded potentials,” ibid., 
vol. 72 (1950), pp. 135-147. 
[10] , “The comparison of spectra belonging to potentials with a bounded 


difference,” Duke Mathematical Journal, vol. 18 (1951), pp. 267-273. 
[11] H. Weyl, “ Ueber gewéhnliche Differentialgleichungen mit Singularitéten und die 
zugehérigen Entwicklungen willkiirlicher Funktionen,” Mathematische 
Annalen, vol. 68 (1910), pp. 222-269. 
[12] A. Wintner, “(Z*)-connections between the potential and kinetic energies of linear 
systems,” American Journal of Mathematics, vol. 72 (1947), pp. 5-13. 
, “On Dirac’s theory of continuous spectra,” The Physical Review, vol. 73 
(1948), pp. 781-785. 


[13] 


2 


PROPERTIES OF CONFORMAL INVARIANTS.* 


By Vipar WoLonTIs.** 


I. Basic Properties of Extremal Distance. 


1 Definition of extremal distance. Let D be a region in the complex 
plane, #, and #, two disjoint compact subsets of D, and I the class of all 
rectifiable curves in D joining #, and E,. Let P be the set of non-negative 
functions p(z) on D such that the integral 


is defined for all pe P and every rectifiable curve y in D, and 

(2) Ap(D) ff p*dedy, +iy, 
exists and is different from zero. We wish to determine pe P so that the ratio 
(3) [int Lp()]*/4p(D) 


be maximal. Since the existence of a maximum is not assured, we consider 
more generally the finite or positively infinite quantity 


(4) Ap(Bs, Bs) = sup [int Lo(y)}*/4p(D), 


which we call the extremal distance between EF, and FH, with respect to the 
region D. For later use we observe that, since the value of the quotient (3) 
remains unchanged if p is multiplied by a positive constant, and since the 
extremal distance (4) is clearly never zero, we may restrict the class P e. g. by 


* Received July 10, 1951. 

** This paper includes the results found in my thesis, “ Properties of Conformal 
Invariants,” Harvard University, 1949. I wish to express my deep gratitude to Pro- 
fessor Lars V. Ahlfors for suggesting problems and methods, and for encouraging 
guidance and great personal interest. 

1¥For the material presented in sections 1-3 of this chapter I am indebted to 
Professors Ahlfors and Beurling for their permission to consult an unpublished manu- 
script. Compare also A. Beurling, Etudes sur un probléme de majoration, Thése, 
Uppsala, 1933, and L. Ahlfors and A. Beurling, “Conformal invariants and function- 
theoretic null-sets,” Acta Mathematica, vol. 83 (1950), pp. 101-129. 


587 


‘ { 


588 VIDAR WOLONTIS. 


the requirement Lp(y) = 1, for all p and y, in which case (4) takes the 
simple form 
Ap(#i, = sup 1/Ap(D), Lo(y)= 1. 

p 


The extremal distance is a conformal invariant of the configuration 
(D, E,, E.), i.e. if 2* = f(z) is a one-to-one conformal mapping of D upon a 
region D*, taking FE, to H*, and FE, to then 


(5) B*2) = 
In fact, with p*(z*) =p(z)/| f’(z)| we have 


Lp+(y*) * | d2* | |dz | Lp(y), 


Aps(Dt) — f f° — ff 
pdxdy = Ap(D). 


The definition of extremal distance remains meaningful if we allow L, 
and FE, to contain accessible boundary points of D. 


2. The extension principle. The following simple principle is an 
important tool in dealing with extremal distances: If D* is a region con- 
taining D, and if #,* and F,* are compact subsets of D* containing F, and FL, 


respectively, then 
(6) (E,*, S Ap( Fi, 


The proof is immediate: Any p(z) defined on D* is also defined on JD, 
(7) Ap(D) S Ap(D*), 
and, since any curve joining #, and F, will a fortiori belong to the class T i 
of curves joining and £,.*, 
(8) inf Lp(y) = inf Lp(y*), where yeT, y* e T*. 


Inserted into the definition (4), these inequalities yield (6). 
In this connection we observe that if #,’ and £.’ denote the boundaries ” 


of and we have EF,’ C EF.’ C and (6) gives 


? Throughout this paper primed letters will denote the boundaries of the respective 


sets. 


CONFORMAL INVARIANTS. 589 


On the other hand every curve joining #, and £, will also join their boun- 
daries, which implies, in analogy with (8), that 


(10) An E.) = Ap( Ey’, E,’). 
Combining (9) and (10) we have 
(11) (Fi, E.) E,‘). 


i.e. for purposes of finding the extremal distance we may replace F, and £, 
by their boundaries. 


3. Determination of a maximal @-function. With certain restrictions 
on £,, E., and D, necessary for the application of the classical existence 
theorems on harmonic functions, we will now determine a function peP 
for which the quantity (3) actually attains a maximum. Suppose 
D* =D C(#, U £,) is a region whose boundary consists of a finite number 
of analytic curves. Then it is known that there exists a function u(z, y) 
harmonic in D* with the boundary values 1 on F,’, 0 on £,’ and normal 
derivative zero on D’. We assert that 


(12) po(2) = (ua? + | grad | 
maximizes (3). 

To prove this we observe that all but a finite number of the level curves 
of the conjugate harmonic function v of wu must join £,’ and F£,’. In fact, 
grad wu can vanish only at a finite number of points, and w is monotonic on 
any curve v=const. not containing such a point, which implies that the 
curve cannot be closed nor have both endpoints on £,’ or both on £.’. It can 
of course not continue endlessly in the interior of D*, since the existence of an 
accumulation point would then yield v= const. Also, a level curve of v with 
grad u=£ 0 is disjoint from D’, since du/dn was assumed to vanish there. 

Now let p(z) be any member of the class p, normalized as in (4’). 
Integrating along any level curve of v joining Z,’ and F,’ we have 


(13) f p/po du = f p/(0u/ds)du = f p|dz|=1 —f{ du; 


hence, since the number of exceptional curves is finite, 


(14) (f p/po dudv = f dudv. 
J Jp D* 


But from this it follows that 


(15) ff (p/po — 1)2dudv <= ff p*/po” dudv — f dudv, 
D* D* 


k 


590 VIDAR WOLONTIS. 


i.e. expressed in z and y, po” being the Jacobian, 


(16) 0< ff (e—po)*dedy < Ap(D*) — Ap,(D*). 


On account of the form (4’) of the definition of extremal distance, and (11), 
this means that 


(17) Ap (Ei, Bz) = Ans (E£y’, 
= 1/Ap,(D*) = 1/ ue? + dzdy = 1/D(u). 
Here and below D(w) denotes the Dirichlet integral taken over 
D* C(#,U 


This completes the proof of (12). 

We may observe that in the restricted case considered in this section, 
and actually by a suitable limit process in a more general case, the relation 
(17) could be used as the definition of extremal distance. In view of the 
electrostatic interpretation, A could then be called the resistance between FE, 
and E>. 

It is easy to extend the validity of (12) to the case where F,, E,, and D 
are still bounded by a finite number of analytic curves, but D* is not con- 
nected. The set D* will then be the sum of a finite number n of regions D;. 
For each one of those components D;, say D,,- - -, Dm, m Sn, whose boundary 
contains both a part EH,‘ of H, and a part H,* of H., there exists an extremal 
distance for which we obtain by (4’) 


1/Ap, (£1, = 1/Ap,( £1‘, = inf Ap, (Di), Lo, (yi) = 1. 
pi 


The remaining Dm.s,- - -,D, have no influence upon the extremal distance 
between H, and £2, since no curve y will pass through them. Setting p = p; 
and p=0 for 1 > we have Lp(y) =1; hence 


(18) (Fi, E,) = inf Ap(D) 
p 
— int 1/ro, (Bit, Est) = 1/ro( Est, 
=1 pt 4=1 i=1 


and by (17) in evident notation 


(17”) 1/Ap(E,, Es) = > D(u) —D(u). 


* The last member of equation (18) has been inserted only for future reference. 


); 


CONFORMAL INVARIANTS. 591 


4, Continuity of 4. If the sets #, and F, do not have the simple 
structure assumed in section 3, we cannot in general find a harmonic function 
in D* with preassigned boundary values and hence the extremal p-function, 
if it exists in the general case, cannot be determined as above. To this end 
we will prove the following lemma, which enables us to extend many of the 
subsequent results on extremal distance to any compact sets. 


LemMaA. Let D be a region, and E and F two disjoint compact subsets 
of D. Then tf {En, Fn} ts a sequence of compact subsets of D, covering E 
and F respectively and converging to E and F (t.e. given e >0 there exists 
N such that forn > WN every point of E, and F,, is within distance « of some 
point of E and F, respectively), we have 
(20) lim Ap(En, Fn) = Ad(E, F). 

n->0O 

We consider the normalized definition (4’), and begin by proving * that 
for any given p, the condition Lp(y) =1 for all y joining # and F implies 
that, given « > 0, for n> n, 


(21) Ip(y’) >1—« 
for all curves, denoted by y’, joining FZ, and F,. Let z be any point of £, 


let C, be the circle | z— 2% | =r for 0 < rk, where k is such that C, C D, 
and f(r) inf Lp(y) for y joining F and C,. We wish to prove that 


(22) a=limf(r) 21. 


From this (21) follows, since # is compact, and the argument can be 
reapplied to F. 

Suppose a <1. Since p is square integrable on D we see, by an applica- 
tion of Schwarz’ inequality, that for any given d > 0 the set S(d) of values 
of r for which 


(23) 


has r = 0 as a point of accumulation. Hence we can select a sequence {rp}, 
m= 1,2,- - - decreasing to zero, r; k, such that 


plasi< 


*TI am indebted to Professor Beurling for an unpublished communication containing 
the argument that follows. 


| 
n 
| 
i 
y 
r->0 
e 
i 
5 
f 


592 VIDAR WOLONTIS. 


But, for 7 joining C,,,, and C,,, 
inf Lp (7) f (tn) — 


Hence a curve y can be constructed, joining / and 2, such that 


Lely) +E (tan) +B olde] + 
Sa+ (1—a)/724+ (1—a) A= (€4+-3)/4<1. 


This contradiction proves (22). 


To obtain (20) from (21) let us first assume that A(H,F’) is finite. 
Given «, 0<e<A(H,F), by (4’) there is a p for which 


1/Ap(D) > A(E, F) 


For this p, and n > nm, by (21) there exist Z,, F, such that Lp(y’) > 1—e 
for all y’. Hence the function 8B = p/(1—e) is one satisfying the normaliza- 
tion (4’) in evaluating A(E£n, F,). We have 


d(En, Fn) 1/Ag(D) = (1 —€)*/Ap(D) > F) 


If A(£#, F) is infinite, given M > 2, there is a p with 1/Ap(D) > M, 
Lp = 1, and the analogous reasoning gives 


Pn) = 1/Ao(D) > (1 —1/M)2M, where o = p/(1—1/M). 


This completes the proof of (20). 


In the definition (4) of extremal distance we assumed the sets EH, and 
E. to be closed. This facilitates the statements and proofs of certain results, 
but it should be pointed out that the restriction is unessential. The definition 
(4) remains meaningful for arbitrary bounded sets #,, H. with disjéint 
closures, and the extremal distance thus defined is equal to the extremal 
distance between the closures Eo. 

The reasoning which led to the extension principle (6) immediately 
gives us the inequality 


To prove the opposite inequality we use the normalized definition (4’) and 
hence wish to show that, if T denotes the class of rectifiable curves y joining 
E, and £,, and T the class of rectifiable curves 7 joining #, and £., then for 
each fixed p, 


(25) int Lp(y) < inf Lp (9). 
ver yer 


zt 
4 
| 
| 
| 
. 
| 
» 
| 
| 
14 


CONFORMAL INVARIANTS. 593 


Let 7 be any curve in f, its endpoints ze #, and fe #,. For each 
positive integer n, let and £,¢E, be points such that | z—z,| 
and |—£n| <2". Denote by yn the curve composed of the polygonal line 
Zns Znsis °°, the curve and the polygonal line - - -, This 
yn belongs to I, and lim Lp(yn) = Lp(7)as n>, which proves (25) ; hence 


(26) =A(F,, E.). 


II. Representation of the Extremal Distance in Terms of a 
Generalized Potential. 


We will now derive a representation of the extremal distance,® which will 
in particular be useful for obtaining estimations. 

Let the region D and the compact subsets HZ, and EF, be bounded by a 
finite number of analytic curves. Let L be a straight line intersecting D, 
and denote by 2, #, etc. the points and sets symmetric to z, HF, etc. with 
respect to L. Consider the set = #2). If we identify 
symmetric boundary points of #, a finite number of Riemann surfaces are 
formed. We are going to apply to the set DM C(#,U E£,), which is now 
contained in the Riemann surfaces, certain methods similar to those of 
logarithmic potential theory in the plane. The reader may in a first reading 
wish to follow the argument in the plane case and may do so by assuming / 
to be empty or to be situated on the line Z. For the applications in Chapter 
III, the general case is needed, however. 

For simplicity we will denote any one of the Riemann surfaces con- 
structed above by D, and by E£,, ZH, and £ the intersections of the original 
sets H,, FE, and E with this D. By the existence theorem for abelian integrals 
of the third kind there is a function @(&, 21, 22) with the following properties: 
Given any two distinct points 2, 22 of D, the difference 


(1) G(&, 21, 22) —log (| /|€—a|) 
is harmonic for £¢ D; when E, G(&, 2, 22) = G(Z, 2) and 
0G 215 /On — 0G (%, Z2) /On; 


and 0G/dn = 0 on the remaining boundary of D. Since G is only determined 
up to a constant, we normalize it by requiring the difference (1) to vanish 


* The possibility of such a representation was suggested to me by Professor Ahlfors, 


a 
i 
| 
¢ 
4 
| 
| 
2 
3 
a 
| 
4 
| 
| 
| 


594 VIDAR WOLONTIS. 


at an auxiliary point =z, on L UL. For later use we observe that the 


relations 
(2) Zo) + 225 21) == 0. 
(3) 415 Z2) 4- G(é, 425 Z3) + 235 21) = (0, 


hold tor z;¢ D, 234 2,22. In fact, let u(¢) denote for a moment the left 
member of either (2) or (3); uw is harmonic throughout D, u(z)) —0 and 


the Dirichlet integral f u(du/dn)| dg£| vanishes due to the properties of G 
D’ 


on the boundary D’ of D. Hence uw is identically zero. To see that G(, 21, 22) 
is harmonic also in z, and z, we choose a point z, in D, distinct from %, z, 
and z;, and conclude from 


[G(g, 415 22) 0G 235 zs) /On G(g, 235 24)0G(£, /On] | dg | 0 


that 
(4) G (a, 235 G (22, 235 Z4) G (2s, 41, Z2) + G (2, 41) Z2) = 0 


Let M; be the set of all Borel distributions »; on the boundary F;’ of £; 
with »(E£;’) = 1, 1 = 1, 2, i. e. measures for which every open set is ineasurable. 
For any ze D, the abstract Lebesgue integral of log|£—z| over EF,’ with 
respect to any such unit distribution is well defined. Thus we may consider 
the function 


for any p, € M,, pw. © Mz and all values of z, and 2, in D except those for which 
both points fall simultaneously on £,’ or simultaneously on £,’. Differen- 
tiating under the integral signs we find p(2,22) to be harmonic in both 
variables in DN E.’). When z,¢F,’ or FE,’ or both, p(4%, 22) 
is lower semi-continuous. To see this we may consider the truncated functions 


Pn( 21, min[G(¢, 41; Z2), n] dy — maxtae, n| dpo(f), 


which are continuous and increase to p(z,, z2) (which may be +0) asn—-o. 
Analogously, if z, ¢ H,’ or z,¢ E,’ or both, we find p(2:, z2) to be upper semi- 
continuous. In particular we conclude that, for each fixed pair of unit 
distributions, the corresponding function p(z1, 22) will attain a minimum for 
z,¢ and z,¢ E,’. 

The main theorem of this chapter can now be expressed as follows: 


, | 
| 
J 
| 


CONFORMAL INVARIANTS. 595 
THEOREM. Let the region D, on a Riemann surface, and the disjoint 
compact subsets E, and E, of D be bounded by a finite number of analytic 
curves. For each pair py, po of Borel unit distributions on the boundaries 
Ey’, E.’ of E,, Ho, let the function p(z:,22) be defined by (5), where the 
kernel G(£, 21, 22) is defined by (1). Then there exists among these patrs of 
distributions a pair for which the quantity 
d = = min p(4%, 22) 
zieki’ 
is maximal, and this maximum is equal to 2a times the extremal distance 
between FE, and 


(6) An (Li, =1/2e max d(p1, = 1/27 max min p(%, 22). 
mie Mi mieMi 


To prove (6) we first wish to show that 
(7) EK.) = 1/27 


for all »; e W;, and then construct the extremal distributions. Given any fixed 
pair »;e¢ M;, it is possible to fix z. on F.’ so that p(z, 22) is still defined for 
ze FE,’ and non-positive there. In fact, if z; is any fixed point in DN C(£,’) 
we can, by the upper semicontinuity of p, choose z2 to be a point of FH.’ at 
which p(z, z;) attains its maximum; since by (3) and (2) 


(8) P(4, 22) = 23) — 2s), 


the desired non-positiveness follows. Given a positive number e smaller than 
d/2, denote by £,* the set where p(z, z2) = d—e, and by F,* the set where 
p(%,%2) Se. The set £,* contains F,’ by the definition of d, and #.* contains 
E,’ by our choice of z.. By the lower semi-continuity of p on E#,’ and the 
upper semi-continuity on £.’, respectively, the boundaries £,*’ and F,*’ of 
E,* and E.* are disjoint from FE,’ and F£,’; hence, as level curves of a harmonic 
function, they are composed of a finite number of analytic curves. The 
function 

(9) p™(z) = (p(4, 22) —e)/(d — 

harmonic in DM C(£,* U £.*), has boundary values 0 on #,*’ and 1 on £,*’, 
and normal derivative 0 on the remaining boundary. By Ch. I, sec. 3, we 
conclude that 


(10) \(E,*, E*) = 1/D(p*) = (d—2)?/D(p). 


But for any c, 0 < c¢ < d, we have, n denoting the inner normal, 


] 
| 
| 
| 
| 
| 
| 
| 
dj 
i 
x 
3 
4 
2 
5 


596 VIDAR WOLONTIS. 


(11) D(p) ——4 ap/am| dz|——a | de| 


+d lal OG (Lo, 2, Z2) /On 
1\$1 G ly 2 
ante (£15 2, #2) | de | 
taf (£2) 0G 2, 22) | dz | 
p=c 


—— 4 | de | — Bad, 


where the change of the order of integration is justified by Fubini’s theorem, 
the next to the last step is a consequence of (4), and the last equality is 
verified separately for each of the topologically different mutual positions of 
the sets #,, EZ, and the points {,, £2 on them. Substituting (11) into (10) 
we have, on account of the extension principle (6), Chapter I, 

A(F,, Ez) = E.*) = (d — 2e)?/2xd, 
hence, 
(7) Ex) = d/2z. 


To find distributions for which equality holds in (7), we consider the 
function u({) which is harmonic on the open set D* =D C(E, U £,), 
takes the boundary values 0 on #.’ and A(£,, #2) on E£,’, and has normal 
derivative zero on D’. Denoting by n the inner normal, we define for all 
subsets e of H,’ and EF,’ respectively, 


pa(e) f du/on || | 
(12) 
na(e) = du/an | 
By (17), Ch. I, for i=1,2, | 
f | du/on | | ——1/r fi u(du/an)| dg | —D(u)/r 
E;’ Ey’ 
= 


We will show that for the corresponding function p the relation 


(13) 22) 21, 22)0u(€)/dn | df | 


597 


CONFORMAL INVARIANTS. 


holds for all z,¢£H,’, z2e¢ £.’, which in particular together with (7) will 
imply (6). 

First let us assume z, and z, to be interior to D*. If C; are circles in 
D* with centers z; and radii r, we apply Green’s formula in each component 
of D* and obtain by adding, 


1) 


(14) 1/24 (Gdu/dn — udG/dn)| dg | =0 


D'y Ey’ Eo’ Civ Ce 


which for r— 0 takes the form 


(15) —U(Z2) = Gou/dn == 22). 


Ey’ E2! 


To extend the validity of (15) to the case where z are on FH and thus com- 
plete the proof of (13), we only have to observe that p(z2:, 22) is continuous 
in 2; and 2, on the closure of D*. This follows from the fact that p(z, 22) 
differs by a continuous function from the ordinary logarithmic potentials of 
the given distributions with continuous densities.® 


III. Estimations by Transportation and Projection. 


3 1. The symmetrization theorem for the entire plane. The original 
definition (4), Chapter I, includes the simplest lower estimation of Ap(Fi, E:): 
| If we choose a particular function p(z), the value of the quantity (3) will 
not exceed Ap. 

‘| The extension principle (6), Chapter I, yields both an upper and a lower 
estimation: We may construct within #, and FE, sets whose extremal distance 
we are able to compute explicitly, and we may cover F, and £, by sets with 


this property. 

In this chapter we shall show how the representation (6), Chapter II, 
enables us to find interesting upper estimations by deforming and moving 
the sets Hj, e. g. placing them in symmetric positions with respect to a given 
axis or projecting them upon a‘given curve. We commence with the following 


THEOREM.’ Let E, and E, be two disjoint compact subsets of the z-plane, 
D. For every r= 0 denote by C, the circle | z| =r, and by a,(r) and a,(r) 


*See O. D. Kellogg, Foundations of Potential Theory, 1929, Chapter VI. 

7 A theorem of the same nature as this for another type of circular symmetrization 
is given, in the case where the sets are bounded by analytic curves, by G. Pélya, 
Comptes Rendus, Paris, 1950, t. 230, p. 25. Also compare G. Pélya and G. Szegé, 
American Journal of Mathematics, vol. 67 (1945), p. 1-32. Their proof is based on an 
entirely different idea. 


|| 
| 
A, 
iS 
yf 
) 
i] | 
l 


598 VIDAR WOLONTIS. 


the angular Lebesgue measures of the sets EH, C, and FE, C,, respectively. 
If the sets B, and EF, are defined by the inequalities 


By: r— S$(r) +7, 


(1) 


 — 3m(r) = $(r) S 
and r being polar coordinates in D, then ® 


(2) An (Fi, <= dp (Fi, £2). 


Our proof of this theorem rests essentially upon 


Lemma 1. Let £, and E, be two disjoint compact sets with boundaries 
composed of a finite number of analytic curves. Let L be a directed straight 
line through the origin; denote the half plane to the right of L by H, the left 
by H. If A is any set in the plane, denote by A the set symmetric to A with 
respect to L. Define the sets and as follows: 


(3) 
Then 
(4) E2) B2*), 


where D again is the entire plane.*® 


We begin by observing that the reasoning leading to the formula (18), 
Ch. I, remains valid in the case where the D; are the Riemann surfaces intro- 


®It is not obvious that the sets #, and #, are closed. Those who wish, in defining 
extremal distance, to restrict themselves to closed sets, may prove the closedness as 
follows: 

Consider a sequence r,ei¢n of points of F,, converging to a point r,ei¢o. It follows 
from the definition of #, that the corresponding sequence of sets H,(C,, has the 
property that lim inf a,(r,)=2¢). We wish to prove that this implies a,(17))= 2¢o. 
Since H, is closed, a,(7)) is greater than or equal to the measure £8 of the set of points 
on E,QC,., which are limit points of sequences of points of the sets E,qC,,. But it is 
well-known (cf. E. J. McShane, Integration, p. 105) that 8 => lim sup a,(r,) > 2¢,. 

A similar remark applies to the theorems in section 2 of this chapter. 

Incidentally, the closedness of the original sets H, and H, in the above theorem was 
used to assure the measurability of their intersections with the circles C,. 


®In words: To obtain E*, from FE, we replace by their symmetric images in H those 
parts of H,7,H whose symmetric images do not belong to Z,. Similarly, we move the 
parts of E, whose images do not belong to 

1°The principle underlying this approach is related to the method of “ rearrange- 
ment ”’ discussed by Hardy, Littlewood and Pélya in their Inequalities, Cambridge, 1934. 


( 


599 


CONFORMAL INVARIANTS. 


duced at the beginning of Ch. II. Hence it is sufficient to prove Lemma 1 
for one such Riemann surface. As before, we will denote it by D, and the 
boundaries of £, 1 D and FE, D by E,’ and E£,’, respectively. 

Consider an arbitrary fixed pair of unit distributions on and 


Define a corresponding pair on as follows: 


yi*(A) =y,(A) for A C (Ey U (E/N #), 
= (4) for A C H) Ey), 


po*(A) = po(A) for A U (E/N B), 
po*(A) =p2(A) for A C H) BY). 


Similarly, if z, ¢ £,’, z2¢ H.’ is any pair of points on the original sets we 


define 
—= 2, for 2, (H,’ Ey’) U #), 
2,* == for 4, € M Hf) C (Ey Ey’), 
(6) 
Zo* = Zo for Z2€ E,’) U 
= %, for Nn £,’). 
Setting 


we wish to show that 
(8) 


» Since (6) sets up a one-to-one correspondence between the pairs of points on 
£,', and E,*’, (8) implies that 


(41, 22) S p* (41%, 22*). 


(9) min 2) S min p* (2, 22) 


for each pair of distributions; hence by (6), Ch. II, and (11), Ch. I, 


We shall collect here a few properties of the function G, which we will 
» need in the verification of (8): If z, and z, are interior points of DN C(£) 
(we recall that H = (2, £,) U (£.M #,)), the function 


(11) G(E, 21, 22) — G(E, 21, 22) 


1In words: if the set A moves in the transformation (3), the mass on A moves 


with it. 


(5 
] 
| 
| 
a 
* 


600 VIDAR WOLONTIS. 


is a harmonic function of £ in D except for positive poles at Z, and 2. and 
negative poles at z, and Z,, and vanishes on HU L. Hence it is non-positive 
if £ and z, are on the same side of Z and 22 on the opposite side, and non- 
negative if ¢ and z, are on the same side of LZ and 2; on the opposite side. 
If z, is a point of Z, the function (11) is singular only at z. and 2, (since on 
the Riemann surface the opposite poles at z, and 2, coincide), hence non- 
negative as soon as ¢ and 2, are on the same side of J and non-positive if they 
are on opposite sides; and analogously for z2e H. If both z,e FH and ze E£, 
the function (11) vanishes identically. 

If z, is an interior point and 2, either an interior point or a point of £, 
the function 
(12) Z1, 22) — G(E, a1, 22) 


is, by (3) and (2), Ch. II, equal to G(f,%,2,). On FUL, 


(the middle equality follows from the fact that G(Z, Z,,z:) — G(€, a, 21) is 
harmonic everywhere and vanishes at 2), i.e. G(f, 2,2) =0. Hence the 
function (12) is non-negative when ¢ is on the same side of Z as Z, and non- 
positive on the opposite side. If z,¢ EH, (12) is identically zero. Analogous 


considerations apply to the function 
(13) 415 22) — 

The verification of (8) must be carried out separately for the different 
possibilities arising from (6). First let us assume that 2,* = z,, 2.* = 2%; 
then 


Ey!’ 


G G(L, #2) duo(£) 
E;’nHoC(E£) oHoC(£) 


f G(£, 21, 22) dpo(£) 


HoC(E£) 


[G(z, Zi, Z2) — G(g, Z2) 


HaC(E) 


[G(z, 22) — G(E, 21, 22) 
HaC(£) 
This is seen to be non-negative by the properties of the function (11) dis- 
cussed above; e.g. in the first integral on the right either z,¢ HN C(£), 


CONFORMAL INVARIANTS. 601 


z,¢ HM C(E), in which case £ and 2, are on the same side and z, on the 
opposite side of L, or one of the points, e. g. z,, is on H, in which case ¢ and 
z, are on the same side of L, or both points are on H. Next suppose z2,* = 2, 
== 2,3 then 


(15) p* (2*, Z2*) p(%, Z2) E Z2) 415 Z2) 


H) y Ey’) 


[G(g, 22) — G(E, 21, 22) |due(€) 


(E2’ 9 H) y 


[G(%, 22) — 21, 22) = 0 
nH 


by the properties of (12) and (13), observing the equation 


The case 2,* = 2,, 2,* == 2, is analogous to (15).- Finally suppose z,* = 2,, 


== then 


(16) p* (21*, 22*) — 22) = f 4, 22) — G(E, 1, 22) 


0 y 0 Ey’) 
(Eq! 9 H) y E92’) 
by (11), since 
Za, Z2) 415 Ze). 
The inequality (8) is verified, and lemma 1 proved. We proceed to 
Lemma 2. Let FE, and E, be two sets of the type defined by the following 


inequalities : 
(v—1)8 
1 
where ,” are numbers in the interval 0 = S 2x, and §>0 (1. e. sets com- 
posed by a finite number of “concentric rectangles” with altitude 8). Then 
E, and E, can, by a finite sequence of transformations each of which does not 
decrease the extremal distance, be transformed into their corresponding sym- 
metric images defined by (1). 
First we consider, for a fixed integer v, the subset H,” of EF, consisting 
of m, rectangles in the annulus (v—1)8<r=vé. We observe that in a 


@ 
a 
t 


602 VIDAR WOLONTIS. 


transformation (3) in any line LZ through the origin the number my, of dis- 
joint rectangles remains the same. A point z of H,’* not in LF,” belongs to 
(LYN H) £,”); either it is connected with the set Ly’, 
in which case Z was connected with FL,” N £,” before the transformation, or it 
belongs to a rectangle F not connected with £,”/N £,’, in which case R was a dis- 
joint rectangle of L,”. The same reasoning applies to points of H,” not in L,”*. 
Now if and with 0S $1’ S gy’ S 3’ S oy’. 
are two rectangles of H,”, a transformation (3) in the axis L: $(¢1” + ¢3”) 
is seen to replace them by a rectangle of length $4,” — $3” + $2” — ¢,” and a 
line segment ¢ = ¢,”, (v—1)8Sr=vd. By Lemma 1 the extremal distance 
does not decrease. Now we remove the line segment; by (6), Ch. I, the 
extremal distance does not decrease. Repeating this process m,—1 times 
we have transformed /,” into a single rectangle with angular measure equal 
to that of Hy’. 

Having done this for all vy in both £, and F,, we wish to transform the 
sets obtained, say £,, F., into their symmetric images (1). Let R, be a 
rectangle of L;, the ¢-coordinate of its center ¢1, with —7r< ¢, <7. Then 
a transformation (3) in the axis L:$(¢:-+7) will place R, symmetrically 
with respect to the negative real axis. For R, CH, with center ¢», 
0 < $2 < 2a, a transformation (3) in the axis L: $2 places R, symmetrically 
with respect to the positive real axis. Using these transformations we can 
now move the components of /, and F2, one at a time, into their symmetric 
positions. The fact that, by the definition (3), once a rectangle has reached 
its symmetric position it remains there under any transformation (3) in an 
axis between 0 and z—and only such axes ZL are used in this paragraph— 
completes the proof of Lemma 2. 

The symmetrization theorem (2) can now be proved at once: Let the 
two disjoint compact sets #, and FH, and positive numbers e and M be given. 
Let the integer R be the radius of a circle with center at the origin, con- 
taining #, and £,. For each integer n we consider the set of closed rectangles 
formed by the circles and rays 


(18) 
= p/2"R, p= 0, 


Denote by #," and EF." the unions of those rectangles which contain points of 
FE, and £,, respectively. For n large enough, £," and £,” will be disjoint, 
and by (20), Ch. I, n can be chosen so large that 


(19) £2") > —«, 


4 

4 

4 

: 

4 

¢ 

4 
i 
4 
4 


CONFORMAL INVARIANTS. 


if \(#;, Ez) is finite, and 


0 

it if A(£,, £2) is infinite. The symmetric images #,", (see (1)) of Ey" 
- [Fe and £.” contain #, and #., respectively. Hence, by (6), Ch. I, and Lemma 2, 


(20) = = E2") > Ez) — 6, 


in the finite case, or 


(20’) \(#,, £.) > M 
. in the infinite case, 1. e. 
e ~ ~ 
(2) A(#,, Hz) 2 £2), e. d. 


The extension principle (6), Ch. I, combined with (2) yields the 


following 


Corottary. Let EH,” be the radial projection of E, upon the negative 
real avis: if C,N ts not empty, and the corresponding 
projection of E. upon the positive real avis: re BE,” if C,N Ez is not empty. 
Then 
(21) dn (Ey, E,’’) = Ap(F, E,) = E,), 


where D is the entire plane. 


2. Similar results for other regions. To obtain results similar to the 
symmetrization theorem in cases where D is not the entire plane, we may 
utilize a simple reflexion principle exemplified by the proofs of the following 


theorems. 


THEOREM. Let EH, and EH, be two closed sets in the unit circle D, 
disjoint from each other and from the origin. If EF, and EB, denote the 
corresponding sets defined by (1), then 


For the proof, let us first assume #, and F, to be bounded by a finite 
number of analytic curves. Let Z, and FZ, be the images of #, and F, in a 
reflexion in the boundary C of D. Let u(z) be the harmonic function in 
DO C(E£, U £,) taking the value 1 on £,, 0 on E2, and with normal deriva- 
tive 0 on C. By (17), Ch. I, we have 


(23) Ap (Fi, #2) =1/Do(u), 
where Dp(u) denotes the Dirichlet integral of wu over DO C(#, U £,). In 


603 
# 

1 

| 
| 

4 

4 


604 VIDAR WOLONTIS. 


the reflexion, u(z) is extended into a function u(z) with boundary values 1 


on FE, U £,, 0 on E,U F£,, and harmonic in the entire exterior of these sets, 


Denoting the entire plane by D*, we then have 

(24) Ap+(#,U U £,) =1/Dp+(u) = 1/2Dp(u) = E2). 
Analogously, 

(25) Ape(£, U E,, U B,) = 


But the symmetrization theorem in D* applies to the left members of these 
equations; hence (22) follows for the quantities at right. 

In the case where H£, and £, are any disjoint closed sets not containing 
the origin, we cover them by rectangular sets and reason as in (20). 


THEOREM. Let EL, and E, be two disjoint compact subsets of the set D: 
(26) 0< < < %, r>0, 


For all r>0 let C, denote the circle |z|—r, and a,(r) and a2(r) the 
angular Lebesgue measures of the sets E,NC, and E,NC,, respectively. 
If the sets E,* and E,* are defined by the inequalities 


E,*: 0S¢(r) S4,(r) 


(27) 
do— S S go, 

then 

(28) S E2*). 


Let us first assume H, and FH. to be bounded by a finite number of 
analytic curves. If ¢) 7, we begin the proof by mapping D upon the upper 
half plane by the function 2” = z"/%, By (5), Ch. I, the sets H,, H., H,*, E.* 
go into sets H,”, E.”, E,*”, E.*” with the same mutual extremal distances. 
Now a reflexion in the real axis yields sets £,” U £,”, etc. to which the 
symmetrization theorem for the plane applies. The relations (24) and (25) 
are again valid; we obtain our theorem for the half plane, and by the inverse 
mapping z = (2”)%/* for our original region D. The extension to arbitrary 
compact sets is performed as before. 

The case where D is a half circle or more generally a circular sector is 
reduced to the above by a reflexion in the circular part of the boundary. 

Extensions of the symmetrization theorem along another line are obtained 
simply by mapping one of the regions D of the above theorems conformally 
upon the region for which a symmetrization theorem is desired. Then, how- 
ever, the symmetrization will not in general remain radial and euclidean. 


; 
| 
koa 
f 
4 
¥ 
4 
ra 


wes 


& 
4 


CONFORMAL INVARIANTS. 605 


3. Logarithmic capacity. Let F be a compact set bounded by a finite 
number of regular curves. Green’s function g(z) for the complement of EF 
is uniquely determined by the following requirements: It is continuous for 
all z, harmonic in C(£), zero on F, and 


(35) g(z) 
where «(| z|)—>0as|z|—>00. The logarithmic capacity (transfinite diameter, 


outer radius) of F is 
(36) Cap(£) 


In this section we will derive and apply a relation between logarithmic 
capacity and a modified form of extremal distance. 

Let z be a fixed point in the plane, and Cp a circle with center 2 and 
radius Rf, containing # in its interior. Denoting by A(Czr, E) the extremal 
distance between Cr and EF with respect to the interior of Cr, we define 


(37) A*(L) = lim {A(Cr, — 1/2 log R}. 
We need not stop here to verify that this limit exists and is finite and 


independent of 2, since this will appear from the argument that follows. 
Consider the function ** 


(38) po(z) =| grad g(z)|. 


For any curve y joining # and Cr we have 


(39) fi 0g/ds | dz| =log R+c+c(R) 


(independently of since log | z | = log | z—2z.| + «(|z—20|)). Further, 
if D=(|z—%|<R)NC(£), 


(40) f f po? = J g ag/an | dz | 
D Cr 


29 
{log + c+ €(R)}{1/R 4+ e(R)}R dd 
0 
= 2r{log R+c¢c+c(R)}, 
again independently of z). Hence by the definition of extremal distance, (4) 
Ch. I, 
(41) A(Cr, = 1/2r(log 


*? The following reasoning is similar to that of Chap. I, 3, and we may thus omit 
details. Tt follows closely the reasoning for the case of a definition analogous to (37) 
of a “reduced extremal distance” between F and a finite point, appearing in unpub- 
lished work by Ahlfors and Beurling. 


| 
| 
| 
| 
| 
| 
| 
4 


VIDAR WOLONTIS. 


Now ‘let p be any member of the class P, normalized by Lp(y) =1. 
If h denotes the conjugate function of g, we have 


hence (compare the discussion of the level curves in Ch. I, 8) 


(43) f J 0/po dgdh = 2m. 
D 


Using this and (40) we find (since p,* is the Jacobian) 


(44) 0S (6/m—1/(log + 

i.e. 
(45) A(Cer, S (log kh+c)+ 
Combining (41), (45) and (37) we conclude that 
(46) A*(E) = c/2n. 


We now claim that A*() does not decrease if H is circularly symmetrized |7 
(see (1)) in any line Z and with any point 2) as center. In view of the 
definition (36) this can be expressed as follows: j 


THEOREM.*® The logarithmic capacity of a compact set bounded by a 
finite number of regular curves does not increase under circular symmetrization. © 


We shall not take the time to remove the restriction on the boundary 
of #, but it should be observed that this restriction is used only to simplify 
the derivation of (46). 

For the proof we choose in the theorem (1) F as H, and the circle Cr § 
with center at the given point z) as H., and observe that in this case the 4 
extremal distance with respect to the entire plane is the same as that with q 
respect to the interior of #.. For each R, A(Cr, #) does not decrease in 
symmetrization, and Cp is its own symmetric image. By the definition (37), 
our theorem follows. 


UNIVERSITY OF KANSAS. 


*® No published proof of this result is known to the author. By comparing the 
paper by Pélya and Szegé with the indications in the note of Pélya, both referred to in | 
the first footnote of this chapter, one concludes, however, that the result should be | 
familiar to them. Compare also pp. 182-216 in G. Pélya and G. Szegi, “Isoperimetric 7 
inequalities in mathematical physics,” Annals of Mathematics Studies, no. 27, Princeton, 4 
1951, where several results closely connected to the theorems in this chapter are found. | 


606 
i 
4 
( 
im 
‘ 


ON GEODESIC TORSIONS AND PARABOLIC AND ASYMPTOTIC 
CURVES.* 


By Puitip Hartman and AUREL WINTNER. 


1. Geodesic torsion. In the differential geometry of surfaces, the assump- 
tion that a surface has a parametrization of class C? seems to be a natural one. 
In fact, it is this assumption that permits the definition of both of the funda- 
mental forms and of the standard curves on the surface (geodesics, asymptotic 
line, lines of curvature). Assumptions of a higher degree of differentiability 
for the surface usually have therefore no geometrical significance. Cf. [6], 
[10]. In the light of this remark, various questions centering about the 
notion of “ geodesic torsion ” will be considered in what follows. 

Let X = (x,y,z) be a vector in a 3-dimensional Euclidean space and 
let, in a sufficiently small open domain in a (w?, u?)-plane, 


(1) S: X=—X(u', u?) 
denote a (portion of a) surface of class C*. By this is meant that X (u', u”) 


is a function of class C? and that the vector product (X,,X2), where 
X;= 0X/du‘, does not vanish. The unit vector 


N = N (ui, u?) = X2)/| (X1, X2)| 

is the normal vector on (1). The first and second fundamental forms of (1) 
are defined by 
(3) ds* = dX -dX = gi,duidu® and —dX-dN = hizduidu*, 
respectively, that is, by 
(4) = Aj Xx and h ik = Ni = Aik N, 
where the dots denote scalar multiplications. 

Corresponding to any point (w', u*) of S, let (u’, u”) represent any pair 


of numbers for which the vector X’ defined by X;(u1, u?)u” is of unit length. 
Define the vector N’ by Ni(u1, u?)u’, the scalar y by 


(5) y = det(X’, N,N’), (| 


and call y—+y(u,u?; uw”, u”) geodesic torsion. If a curve T: X = X(s) 


* Received October 25, 1951. 


607 


| 
e 
. 
yg 
y 
k 
1€ a 
h @ 
n 
F 
4 
id. 


608 PHILIP HARTMAN AND AUREL WINTNER. 


of class C* on the surface S passes through the point X of S and has, at 
that point, the unit tangent vector X’ = dX(s)/ds, then (5) will be called 
the geodesic torsion of T at the point X. 

The geodesic torsion of the point XY and the unit vector X’ is often defined 
to be the torsion (at the point XY) of a geodesic through X in the direction X’. 
Two objections can be raised to the latter definition. First, a geodesic on a 
surface of class C? is only of class C?, while the standard definition of torsion 
is applicable only to arcs of class C*. The second objection is that, even if 
the geodesic is very smooth, its torsion at a point is undefined if the curvature 
of the geodesic curve vanishes at that point. 

The first criticism can be overcome if the word torsion of a curve (which 
need not be on a surface) is defined as in [6], pp. 770-772, where it was 
shown that torsion can be defined for certain arcs of class C? with non-vanishing 
curvature. In particular, if such an arc has a principal normal of class C’, 
then the torsion can be defined geometrically and so as to satisfy the corre- 
sponding Frenet equations. As an application of that definition of torsion, 
suppose that T' is a geodesic of (1) and has a non-vanishing curvature (at a 
point, hence near that point). Then it has a principal normal, namely + N, 
and the latter is of class C* as a function on the arc length on T. Since 
the binormal of IT is the vector product (X’,+ N), the Frenet equations, as 
used in [6] (Theorem VI, p. 772), imply the truth of the following assertion: 


(1) Ona surface of class C*, a geodesic arc of non-vanishing curvature 
possesses a torsion, (5). 3 


This disposes of the first of the two objections mentioned above. In 
contrast, the second of those objections cannot be overcome, except by some 
ad hoc definition of what the torsion of a curve should be when the curvature 
of the latter vanishes. In fact, even on a surface of class C®, those points 
on a geodesic arc, at which the curvature of the latter vanishes, can form a 
nowhere dense perfect set. Needless to say, a geodesic (or any curve of 
class C? on a surface of class C*) can have a vanishing curvature at a point 
only if it is in an asymptotic direction at that point; so that this is the only [| i 
case excluded in (I). 

Whenever the surface (1) is of class C?, Weingarten’s derivation formulae, 

= — gi*hijX,, where (g**) denotes the inverse of the matrix (9%), are 
applicable. Hence (5) can be written in the form 


(6) Y= — g*hyu’u”), 
where g = det(X,, X2, N) = (det > 0. 


q 
Pp 
| 


: 


GEODESIC TORSIONS AND PARABOLIC CURVES. 609 


In order to interpret (6), suppose, without loss of generality, the 
normalizations 911 = = 1, = 0 and hy. = 0 at the point (u',u?). Then 
hy, and ho. are the principal curvatures, say x, and xz, and the directions 
determined by (u”, u”) = (1,0) or 0,1) are directions of principal curvature, 
associated with x, or x2, respectively. If (u”,u”) = (cos 6,sin 6), where 6 
is the angle from the direction (1,0) of principal curvature, associated with 
k,, to the direction (u”, u”), then (6) reduces to 


(7) Y= (ke 


a formula which is standard (cf. e. g., [3], p. 389) under substantially more 
severe restrictions than the present assumptions. 

It is clear from (7), and from the assumptions under which it was derived 
above, that the following statement is a corollary: 


ki) cos sin 6, 


(II) Jf T ts a curve of class C* on a surface of class C*, then y =0 
holds at every point of T tf and only tf T is a line of curvature. 


2. On the Beltrami-Enneper theorem. A point of (1) is called elliptic, 
hyperbolic or parabolic according as dethy (or, what according to (4) is 
the same thing, the Gaussian curvature K —det hiy/det gi) is positive, 
negative or 0. 

If uw?) is a non-elliptic point of (1), then a direction through 
the point is called asymptotic if 


(8) = 0. 
In the normalization introduced before (7), condition (8) reduces to 
(9) kK, cos” 6 + sin* 6 = 0. 


On the other hand, since the product «,x. is the Gaussian curvature, (7) is 
identical with 


(10) 
if (9) is satisfied. In view of (7), this proves the following assertion: 


(III) Jf T is an arc of class C1 on a surface of class C?, then (10) 
holds at every point of T if and only if every tangent vector of T either is, 
or is orthogonal to, an asymptotic direction. 


The theorem of Beltrami-Enneper states that (under certain conditions 
of smoothness which usually are not, or are erroneously, specified; cf. [6], 
pp. 773) the relation (10) holds as an identity along asymptotic curves of 


% 

] 

| 


610 PHILIP HARTMAN AND AUREL WINTNER. 


non-vanishing curvature. The first assertion of (III) contains this theorem, 
and (III) avoids the difficulty involved in a statement about the torsion (not 
the geodesic torsion) of an asymptotic curve; cf. (IV) below. This difficulty 
arises when the asymptotic curve is only of class C* (or if, when it is smoother, 
it possesses points at which its curvature vanishes). In this regard, cf. [6], 


pp. 773-774. 


3. Ona result of P. Franklin. A curve T on a surface of class C? will 
be called a parabolic curve if every point of T is a parabolic point, that is, if 
K=0on TP. 

According to P. Franklin [4], pp. 254-256, a “ regular ” parabolic curve 
must be a line of curvature. His definition of a “regular” curve T on a 
surface (1) consists of the following specifications: a) no point of T is 
singular, and b) at no point of T does a normal section of the surface § 
have a flex point; finally c) no point of T is a flat point of S (that is, a 
point at which both factors «x; of K = x,«xs vanish). 

Franklin’s paper was reviewed by Cohn-Vossen and by Rinow. The 
former [2] points out that Frankin’s proof is not valid, since it contains a 
formal error, while the latter [9], without commenting on Franklin’s proof, 
remarks that Franklin’s final assertion (see above) is not surprising, since 
the set of the conditions a), b), c) which define “ regularity ” is quite severe. 
In what follows, there will be clarified the actual situation resulting from the 
nature of both of these criticisms. On the one hand, it will be shown that if 
condition a) is interpreted to mean that grad K (exists and is continuous 
on § and) does not vanish on the parabolic curve 1: K = 0, then Frankin’s 
proof can be saved. On the other hand, it will be shown that condition b) 
is so severe that the final assertion becomes practically vacuous. In fact, the 


situation proves to be as follows: 


(*) Ona surface S of class C*, let a parabolic curve T: K =0 on 8 
be such that grad K ~ 0 on T, and suppose that T satisfies Franklin’s condition 
b). Then T is of class C? and a plane curve; as a matter of fact, a plane curve 
along which the tangent plane of S does not vary; in addition, T is a line 


of curvature as well as an asymptotic curve. 


Actually, it cannot even be expected that a parabolic curve will “in 
general” be a line of curvature. In fact, the notion of a parabolic curve is an 
intrinsic one, depending only on the metric (giz), while the notion of a line 
of curvature depends on the embedding of the metric into the 3-dimensional 
Euclidean space (that is, on (hi) as well). The severity of Franklin’s con- 


( 
t 
is 
tl 
( 
T 
mY 
| 
RE 
i a r 
4 


ane 


Rey 


GEODESIC TORSIONS AND PARABOLIC CURVES. 611 


ditions is shown by the last italicized statement, a statement which implies 
that the parabolic curve must become an asymptotic curve (and a line of 
curvature). This situation is the more understandable as the asymptotic 


eurves are the characteristics of the partial differential equation of second 
order on which the problem of embedding depends. 


4. Asymptotic curves and parabolic curves. In view of (II) and (III), 
and of the last italicized statement, (*), which remains to be proved, it seems 
to be worth while to clarify the relationships between the following three 
assumptions: (a) IT is an asymptotic curve; (B) T is a line of curvature ; 
(y) T is a parabolic curve, where it is always assumed that I is a curve of 
class C? on a surface S of class C?. It will be shown that (i) conditions (a) 
and (8) imply (y) and that (ii) conditions (a) and (y) imply (B), but that 
(iii) conditions (B) and (y) do not imply (a), while (iv) conditions (a), (8), 
(y) all are satisfied if and only if T is a plane curve aiong which the tangent 
plane to the surface S does not vary.* 

The assertions (i) and (ii) are essentially those of Cohn-Vossen [1], 


“ envelope ” 


pp. 274-275, who, however, involves the extraneous notion of an 
as well as the notion of torsion, of an asymptotic curve (and therefore, in 


particular, heavy restrictions of differentiability). 


Proof of (i) and (ii). Condition (a) implies, by (III), that y = + (— K)4 
on Hence, y= 0 if and only if K —0. It follows therefore from (II) 
that an asymptotic curve is a line of curvature if and only if it is a parabolic 
curve. 


Proof of (iv). Clearly, condition (a) is equivalent to the assumption of 
the relation X’- N’ = 0 along I, which means that the tangent vector X’ is 
orthogonal to N’ along fT. On the other hand, the differential equations of 
Rodrigues for lines of curvatures, N’ + xX’ =0, where «x; is a principal 
curvature and 1 = 1, 2, show that X’ is parallel to N’ along T if and only if T 
is a line of curvature. Thus X’ is orthogonal to, and at the same time parallel 


1In his Introduction to Differential Geometry (Princeton, 1947), L. P. Eisenhart 
claims that every plane asymptotic curve is a straight line (p. 249, Ex. 10). That this 
theorem is false, or that a (smooth) curve I satisfying the three conditions (a), (8), 
(y) need not be a straight line, is shown by the following example: Consider the curve 
(#,@?,0) on the surface S: (a, y,2) belonging to z= (y—w#*)*. Along the curve 
y = x* of this surface, both z, and z, vanish identically. Hence the plane z = 0 contains 
r and is the tangent plane to S at every point of T. It follows therefore from (iv) 
[and, of course, from an easy explicit calculation as well] that conditions (a), (8), (7) 
are satisfied. But I is not a straight line. 


> 
| 
| 
| 
l 
e 
| 
s 
a | 
I 
e | 
) | 
] 
1e 
8 
yn 
ve 
in 
ne 
n- 
4 6 


612 PHILIP HARTMAN AND AUREL WINTNER. 


to, N’ if and only if (a) and (8) hold. However, in this case, since X’ ~ 0, 
it follows that N’ =0 along the curve f. This means that the plane tangent 
to the surface § is constant along the curve. 


Proof of (iii). Conditions (8) and (y) imply that both y = 0 and (10) 
hold along T. Hence (III) shows that I is either an asymptotic curve or is 
orthogonal to an asymptotic direction at every point of Tf. Consequently, (iii) 
will be proved if it is shown that the second case of this alternative can 
actually occur. But it is easy to verify that it does occur if the surface (1) 
is chosen, for instance, as follows: z= 2? + where (2, y,z) =X. 


First, the partial derivatives, zz, z, and Zz2, Zyy, Of 2 2? + are 
2a, 3y” and 2, 0, 6y, respectively. Hence, from (2)-(4), where u* =z, u? =», 


(11) = 1-4 42’, Ji2 = = 1 + 9y* 
and, if +(1-+ 42? + 9y*)4 is denoted by 7, 

(12) his = 2/), his = 0, hoe = 6y/). 
Since K = det hi,/det gix, it follows that 

(13) K = 12y/(1 + 42? + 9y*). 


Consider the curve 
(14) r= y = 0, 


This curve is on the surface 8S: z= 2? y*. On the other hand, (12) shows 
that (8), where u?) = y), reduces to + 6ydy? = 0, a differential 
equation for y= y(x) which is not satisfied along the curve (14). Hence 
(14) is not an asymptotic curve. But it is a parabolic curve, since (13) and 
(14) imply that K —0. Thus all that remains to be ascertained is that (14) 
is a line of curvature. This follows, however, by observing that, according to 
(11) and (12), both g.. and hy. vanish identically along the coordinate axes, 
== 0 and y = 0, which means that the latter are lines of curvature. Hence 
the assertion follows from the second of the equations (14). 
Condition grad K £0 of (*) is satisfied in this example, since the partial 
derivative of (13) with respect to y is 12/(1 + 47), hence distinct from 0, 
along the curve (14). 


5. Proof of (*). Consider the surface (1) in a Cartesian parametric form 
S: z= (2,y), where z(z,y) is a function of class C® in a vicinity of 
(z,y) = (0,0). It can be supposed that the coordinate axes have been 


si 
( 
of 

( 
| 
( 
H 
an 

(1 

ho 

(2 
2(( 
dit 
r 

K, 
eq 

tha 
dx 

tra 

clas 

| the 
4 cur 
of 


GEODESIC TORSIONS AND PARABOLIC CURVES. 613 


chosen so that (x,y) = (0,0) corresponds to a given point of I, and that 
the unit normal vector at this point is directed along the z-axis. Thus 


(15) 2(0,0) =0 and 2,(0,0) 0, z,(0,0) =0. 


Suppose that the Gaussian curvature, K = K (2, y), vanishes at (0,0). Then, 
since 

(16) K = — + 22? + 2y?)? 

for every (x,y), and since K(0,0) = 0, it can be supposed, after a rotation 
of the (x, y)-plane about the origin, that 


(17) Zey(0,0) = 0 and 2,(0,0) =0. 
The last three formula lines show that 
(18) Kz(0,0) = and Ky(0, 0) = %20(0, 0) 2yyy(0, 0). 


Hence, | grad K |? = K,? + K,? will not vanish at (0,0), and therefore at 
any (z,y) in a vicinity of (0,0), if and only if 


(19) +0 


holds and not both Zyyz(0,0) and Zyy,(0,0) vanish. But if assumption b) of 
(*) is satisfied, then Zy,2(0,0) cannot vanish, since 


(20) Zyyy(0, 0) = 0. 


In fact, if (20) did not hold, then, since (15) and (17) imply that 
2(0, y) = Zyyy(0, 0)y/6 + o(| y |*), it would follow that the normal section 
t=0 of S: z=2(z,y) has a flex point at y—0. 

The equation K(z,y) 0, defining a parabolic curve I, and the con- 
dition that not both K, and Ky, vanish at a point, say (0,0), of I show that 
lr is a curve of class C* and satisfies the non-singular differential equation 
K,dx + K,dy=0. According to (18), (19) and (20), this differential 
equation reduces to dx/dy = 0 at (x,y) = (0,0). But (17) and (19) imply 
that dr/dy = 0 defines the (unique) asymptotic direction at (0,0). Hence, 
dx/dy is an asymptotic direction at (0,0). Since (0,0) represents an arbi- 
trary point of T, it follows that [ is an asymptotic curve. Hence TI is of 
class C? and (5) follows from (i), (ii) and (iii). 


6. Ona theorem of van Kampen. Using a fallacious generalization of 
the Beltrami-Enneper formula, r? = K, for the torsion, r, of an asymptotic 
curve, van Kampen [8] has arrived at the following result: 


(t) Let 8 be a surface of class C*, and P a hyperbolic point (K < 0) 
of S. Let J denote one branch of the intersection of S and of the plane 


] 

| 
| 
] 
] 

4 

A 

4 


614 PHILIP HARTMAN AND AUREL WINTNER. 


tangent to S at P, and suppose that at every point distinct from P the 
plane curve J has a non-vanishing curvature. Then there exists on S§ at least 
one asymptotic curve, say T, which passes through P in such a way that J is 
tangent to T at P and lies between T and the common tangent ; that is, J lies 
between I and the normal section of S determined by the common direction 
of J and LF at P. 


That van Kampen’s generalization of the Beltrami-Enneper formula is 
false, is seen by comparing it (Theorem (I) in [8]) with Bonnet’s formula 
(cf. (44) below), which connects the curvature and the torsion of a curve 
drawn through a point of a surface in an asymptotic direction. The error 
is made when, in the last sentence of the second paragraph of [8], p. 992, 
van Kampen assumes that he can differentiate a relation (along a curve), 
whereas the relation in question is valid only at one point of that curve. 

It turns out, however, that van Kampen’s final result, represented by the 
last italicized statement, (t), happens to be correct. Tis will be proved in 
what follows. 


Remark. It will remain undecided whether, in the above wording of (tf), 
the passage “at least one asymptotic curve ” can be replaced by “all asymp- 
totic curves ” or, for that matter, by “ the asymptotic curve.” In this regard, 
cf. the example, given in [7], pp. 153-156, of a surface S of class C? having 
a negative curvature K and containing a point P which issues more than one 
asymptotic curve (hence a continuum of asymptotic curves) in the same 
asymptotic direction. In that particular example, there exist asymptotic 
curves on different sides of their (common) tangent at P but, in contrast to 
the assumption in (f) above, the curvature of the curve of intersection, J, 
vanishes at a sequence of points which cluster at P. Hence it remains a 
question whether or not the non-vanishing of the curvature of J (at those 
points of J which are distinct from P) implies that all asymptotic curves 
(touching J at P) are separated by J from the tangent of J at P or, for that 
matter, that the asymptotic curve (through P and in the direction of J) 


must be unique. 


It may be mentioned that (+) could also be deduced from Beltrami’s 
formula (cf. (50) below), if S is of class C? and J has a non-vanishing 
curvature. 


7. Proof of (t+). It can be assumed that S is given in the form 
z= z(x,y), where z(2z, y) is a function of class C?; that P is at (a, y) = (0, 9) 
and that (15) is satisfied; finally that, since the value of (16) at P is 


If 
n 
(2 
Ww 
the 
fol 
an 
In 
coc 
ple 
by 
rep 
(2 
whe 
ef, 
of . 
the 
cur 
posi 
the 
(23 
and 
(24 
(25 
Sine 
posi 
or n 
(in 
ture 
[y(a 


GEODESIC TORSIONS AND PARABOLIC CURVES. 615 


supposed to be negative, zz2(0,9) = 2yy(0,0) 0, while is positive. 
If (by the choice of the units of length along coordinate axes) this positive 
number is chosen to be 1, it follows that 


(21) =zy+0(2?+ 9’), 


where the o-term represents a function of class C* in (x,y). The form on 
the left of (8) reduces at P to 2’y’, a form which represents positive values 
for directions corresponding to the first and third quadrants, (z’ > 0, y’ > 0) 
and (a < 0,y’ < 0), and negative values for the second and fourth quadrants. 
In particular, the asymptotic directions through P are the directions of the 


coordinate axes. 

Consider that branch, J, of the intersection of S and of the tangent 
plane at P (i.e., of the plane z 0) which is tangent to the z-axis. Then, 
by Lemma (§) of Section 8 below, J is a curve which (for small | 2 |) can be 
represented in the form 
(22) J: y=y(e), 


where y(z) is a function of class C1 for all z, and of class C? for all x ~0; 
cf. the assumption of (t) concerning the non-vanishing of the curvature 
of J at points distinct from P. 

According to (21), and since (22) is a curve passing through P = (0, 0), 
the partial derivative z,(2,y) for small positive is 0(x) along the 
curve J. Hence, if « in (22) is chosen to be positive, then z,(z, y(x)) is 
positive (for small 2 >0). On the other hand, since the curve (22) is on 
the surface z = z(x,y) and on the plane z = 0, 


(23) a(x, y(x)) =0, 

and differentiation of (23) with respect to 2 gives 

(24) + y (x) )y’(z) =0 

(even if c= 0), whence one more differentiation leads (if x40) to 

(25) Zor + + Zyyy’? + zy” = 0, where y= y(2). 


Since the coefficient, z,, of y’” in (25) was just seen to be positive (for small 
positive a), it follows that the sum of the first three terms of (25) is positive 
or negative according as y’’(x) is negative or positive for small positive x 
(in fact, y’’(2) —0 is excluded by the assumption of a non-vanishing’ curva- 
ture at the points of (22) distinct from its point belonging to c= 0). 
Suppose, for instance, that y”(x) >0 (for small positive 2). Then 
[y(x)] <0, if [y(x)] denotes the sum of the first three terms of (25). 


616 PHILIP HARTMAN AND AUREL WINTNER. 


On the other hand, (8) shows that if y= y*(r), z—=2z(az,y*(z)) is an 
asymptotic curve, then [y*(x)]—0. It follows therefore from the signature 
rule of the four quadrants (described, after (21), with regard to the second 
fundamental form of the surface at the point P), that, for small positive z, 
the slope of the relevant asymptotic direction at the point (z,y(z)) of the 
surface is greater than the slope of the curve (21) at the corresponding z. 
In fact, this is clear for reasons of continuity, since the asymptotic directions 
at the point (x, y(x)) of the surface are close to those at the point P = (0, 0), 
while the latter directions are those of the coordinate axes, —0 and y= 0. 
Accordingly, if a is any point of an interval 0 <a < a, where a>0 is 
sufficiently small, then there is an asymptotic curve (at least one), say 


(26) To: y= y(%30), 

passing through the point (a, y(a)) of the curve (22) in a direction nearly 
tangent to the direction of the curve (22) at =a, and all functions (26) of 
z will exist for 0 [2a (if 0 <<a <a, and if a is sufficiently small), finally 
all these curves (26) will satisfy, with reference to the curve (22), the 
inequality 

(27) y(z3;a) >y(z) ifa<cesa. 


Since the surface § is hyperbolic at (hence near) the point P = (0,0), 
its asymptotic curves (26) will satisfy, for 0 = 2 =a, a differential equation 
(8), which is of the form 
(28) y’ =f (2,9), 


where f(x,y) is continuous (for small z*-+ y?). Hence the standard argu- 
ments, which deal with (28) on the basis of equicontinuous functions, show that 
there exists a sequence of positive numbers aj, a2,- - - satisfying a > a,—>0, 
as n —>oo, and having the property that y(2;a,), the function (27) belonging 
to a=a,, tends to a limit function, uniformly for 02a, as n>. 
Furthermore, this limit function y = y*(z) is (as is each of the curves (26)) 
the graph of the projection on the (2,y)-plane of an asymptotic curve. 
Finally, it follows from (27) that 


(29) y*(2)Zy(z) for 


Clearly, (29) completes the proof of the last italicized statement, (ft), 


if the lemma (§) of Section 8 is granted. In fact, while (29) was deduced |~ 
for the case in which y’(x) > 0 for small 2 > 0, the case in which y”(r) <0 |~ 
for small z > 0 (as well as both cases of a small —z>0) can, of course, ' 


be treated in the same way. 


4 


rd 


e 
al 
| cc 
m 
tr 
a 
| 
othe 
3 
| wh 


GEODESIC TORSIONS AND PARABOLIC CURVES. 617 


8. On the “true” Dupin diagram. Let T be a plane through a hyper- 
bolic point P of a surface § of class C?. If T is not the plane tangent to S 
at P, and if § is a sufficiently small neighborhood of P, then the set ST, 
along which 7 intersects S, consists of a Jordan arc of class C*. This is an 
immediate consequence of (the C?-form of) the classical theorem on implicit 
functions, the exemption of the tangent plane being equivalent to the non- 
vanishing of the gradient involved. Correspondingly, if 7 is the tangent 
plane, then, since the gradient vanishes at P, no general theorem on implicit 
functions is applicable to either of the branches of which, in view of Dupin’s 
indicatrix, the set ST can be expected to consist. That the essential (but, 
perhaps against expectation, not all) aspects of what is indicated by Dupin’s 
approximation is nevertheless true, is the content of the first two assertions 
of the following lemma (the third assertion of which shows that, because of 
the vanishing of the gradient, the C?-character of the “implicit functions ” 


ean actually be lost). 


(§) If S is a sufficiently small neighborhood of a hyperbolic point P 
on a surface of class C?, and tf T denotes the plane tangent to S at P, then 


(i) the intersection ST consists of two Jordan arcs, say J, and Jo, 
each of which contains P in tts interior, and P is the only point common to 
J, and Jo; 


(ii) both plane curves J,, J2 have continuous tangents (also at P) and, 
except possibly at P, continuous curvatures as well; 


(iii) the curvature of .J; at P need not exist (and, tf it exists at P, 
it need not be continuous at P), unless an assumption going beyond the C?- 
assumption (such as the C%-assumption) is required of 8. 


Assertion (i) and (at least the first part of) assertion (ii) are closely 
connected with the results of Hadamard [5] on the invariant curves of a 
surface transformation near a fixed point of hyperbolic type. It is, however, 
more convenient to prove (i) and (ii) directly, and in a way which, in con- 
trast to Hadamard’s procedure, does not depend on the method of successive 
approximations. 

As at the beginning of Section 7, it can be assumed that § is given in 
the form z = z(x,y), and that T is the (x, y)-plane and P is its point (0,0), 
but that (21) is replaced by 


(30) S: =4(y?’— 2’) + 0(2?+ 9’), 


where the o-term represents a function which is of class C? in (2, y) (in fact, 
(30) differs from (21) in a rotation about P). According to (30), the 


| 

| 

> 


618 PHILIP HARTMAN AND AUREL WINTNER. 


asymptotic directions of S at P, represented by the asymptotes of Dupin’s 

hyperbola at P, are the bisectors (c + y=—0) of the coordinate quadrants, 
Since J is the plane z = 0, its intersection with S is the set of points 

(x, y) satisfying 

(31) ST: 3(y?— 2?) + 0(a* + y*) =0. 


It is clear from (31) that, if S, or the (a, y)-domain under consideration, 
is chosen small enough, then every point of ST is contained in one of the 
four wedges (issuing from (0,0) and bisected by the four asymptotic half- 
lines) which are defined by the inequalities $|2|=|y|S2|2|. 

Consider the wedge contained in the first quadrant of the (z, y)-plane, 
that is, the wedge 


(32) ¢¢SyS2 (so that x > 0 and y > 0 unless 


It will be shown that those points (2, y) of ST which are contained in this 
wedge form (for sufficiently small = 0) a Jordan are which is representable 
in the form y=y(x), where y(x) is a (single-valued) function having a 
continuous first derivative y’(x) ; that y’(0) (when interpreted as the deriva- 
tive at = 0 from the right) is 1; finally that the function y(z) has a con- 
tinuous second derivative y’’(z) if t >0. Since (32) could be replaced by 
any of the four wedges, this will prove assertions (i) and (ii) of the last 
italicized assertion. 


Proof of (i)-(ii). It is clear from (30) that, if x is positive (and small 
enough), then z(z,32) and z(z,2x) are of opposite sign. Hence z(2,y) 
must vanish at least once, say at the ordinate y= y(zr), when z > 0 is fixed 
and y varies from the lower to the upper half-line bordering the wedge (32). 
On the other hand, since the o-term in (30) is a function of class C? in (2, y), 
it is clear from (30) that z,(z,y) =y-+0(a2*+ y?)3. Hence, if x >0 is 
small enough, z,(z, y) is positive within the wedge. Consequently, the ordinate 

= y(xz), mentioned before, is unique. It now follows by standard argu- 
ments, occurring in the proof of the classical theorems on implicit functions, 
that the function y(z) is continuous for z= 0 and that it has a continuous 
second derivative for « >0; cf. (24) and (25). That it has a continuous 
first derivative at x = 0 also, follows from (24). In fact, (30) shows that 
(24) can be written in the form 


+ (y(z) + 0(z))y’(x) =0 
for small positive z, and (31) shows that y(z)/z=1- 0(1). 


Proof of (iii). In a neighborhood of z= 0, let f(x) be any function 


a 
fer 
eC 
4 
| 
| 
‘ 
4 | 
| ft 


GEODESIC TORSIONS AND PARABOLIC CURVES. 619 


satisfying the following pair of conditions: (a) there exists a continuous 
second derivative f’(x) (also at (b) if 0, then f(r) = 
In terms of such an f(x), define S by z = z(x,y), where z(x,y) =azy + f(z). 
Then S is of class C?, by (a), and (21) is satisfied, by (b). Furthermore, 
it is seen that (23), the equation defining the intersection ST, splits into 
J,: e=0 and Jz: y=f(z)/x. Hence, in order to conclude the truth of 
(iii), it is sufficient to observe that there exist two functions, say f(z) = g(2) 
and f(«) —h(), which satisfy both (a) and (b) and have the property that 
the function defined by = f(x)/a (if x ~0, and by y(0) —0 if x = 0) 
has at 2 = 0 no second derivative or a discontinuous second derivative according 
as f=g or f=h. 


9. Geodesic curvature and geodesic torsion. After the italicized assertion, 
(+), of Section 6, reference was made to a formula due to Bonnet (cf. [3], 
pp. 397-399). Inasmuch as this formula, which is (44) below, is usually 
derived in a somewhat roundabout way and without a specification of the 
assumptions on which it depends, it will be proved in what follows by a more 
direct approach, leading to a reasonable minimum of the conditions to be 
required for its validity. 

Let S be a surface of class C*, and T: X =X(s) a curve of class C’ 
on T, where s denotes the are length on Tf. Define on IT three, mutually 
perpendicular, unit vectors V;, V2, V; (which form a right-hand orthogonal 
system), by placing 


(33) -Vi(s) =X%(s), Va(s) Va(s) = (X"(s), N(s)), 


where the prime denotes differentiation with respect to s and N is the surface 
normal, (2), expressed along T as a function of s. Clearly, all three functions 
Vi(s) are of class C’. Hence the three derived vectors, Vi’, exist, are con- 
tinuous, and are linear combinations, with continuous scalar coefficients, of 
the three vectors V;. These coefficients can be calculated as is usual for 
all “derivation formulae.” This leads to the well-known “ geodesic Frenet 


equations,” 
(34) V,’—aV.— BVs, =—aV,+ yV3;, = BV, — yV2, 
where 


(35) a=—X”-N, Bm det(X’,X”,N),  y—det(X’, N,N’). 


The latter y is identical with the y in (5), which in Section 1 was defined 
to be the geodesic torsion. Correspondingly, the second of the relations (35) 
shows that @ is identical with the classical (“embedded ”) definition of the 


4 
3 
& 
ba 
/ 
2 
& 
| 
| 
4 
q 
> 
4 
a 
4 
a 


620 PHILIP HARTMAN AND AUREL WINTNER. 


geodesic curvature. Finally, it is seen from (4) that the first of the relations 
(35) can be written in the form 

(36) a =— X’- N’; hence a= hyu’u", 

by (3) (so that a is the normal curvature). Thus a = 0 is equivalent to (8), 
which is the definition of an asymptotic direction (if any; that is, if K =0). 

If points of T at which | X” | may vanish are excluded, then the set (33) 

(in which the assumption | X” | > 0 is not needed) can be paralleled by the 
set consisting of the unit vectors of the tangent, principal normal, and 
binormal, of I, that is, by the set 

(37) U, =X’, | X” == | X” |-*(X’, X”). 

Then the “ geodesic ” Frenet equations, (34), become replaced by the ordinary 
Frenet equations, 

(38) U1’ = Uz, = — «U, + =— U2, 

and, correspondingly, the data (35) by the (ordinary) curvature and the 
(ordinary) torsion, 

(39) x= |X” |>0 and r= det(X’, X”, /x’, 

provided that [: X —X(s) (instead of being, as before, just of class C?) 
is of class C*. But the definitions (37) and the first of the relations (38), 
where x= |X” | > 0, do not require the latter proviso, and imply, in view 
of (37), the identities 

(40) U,=—Vi, V;sinwo, Us =— Vzsinw+ V3; cosa, 

if the (continuous) angular function » = w(s) is defined (mod 27) by 

(41) cos = N- U2, sin » = — det (Ui, U2, N). 
If this is compared with (37) and (38), it follows that 


(42) a =k COS w, B=—ksino. 


Since the definitions of a, 8 and x imply that a—=0— 8 if «=O, both 
equations (42) hold for x 0 also, provided that the angle w, which the 
case x = 0 of (41) leaves undefined, is considered as arbitrary. 

It will now be supposed that x > 0, and that T is of class C*. Then (38) 
is applicable and, in view of (41), the continuous function o = .(s) is of 
class C1, as is x==x(s). Hence, if (40) is differentiated, and if the result 
is compared with (34) and (38), it follows that 


(43) wo’ = 


A corollary of (36), (42) and (43) is the following pair of facts: 


. 


1€ 


GEODESIC TORSIONS AND PARABOLIC CURVES. 621 


(IV) Let 8 bea surface, of class C*, having no elliptic points (so that 
K =00nS8), and let T be a curve, of class C?, on S. Then T 1s an asymptotic 
curve if and only if x(s) =| B(s)|, where x(=0) is the curvature, and B 
the geodesic curvature, of T. If, in addition, x(s) ~0 holds on an asymptotic 
curve I’, then r(s) =y/(s), where r(s) is the torsion, and y(s) the geodesic 
torsion, of T. 


Since S is supposed to be of class C*, the asymptotic curves are of class 
C?. Hence, the classical definition, (39), of r+ will not in general apply 
(cf. [6], pp. 773). But if the torsion, +, is defined as in [6], pp. 770-772, 
then the asymptotic curve has a torsion at all those points s at which x(s) 0. 
This, and only this, makes meaningful the second assertion of (IV). 

Of course, the converse of the second assertion of (IV) is false, that is, 
the assumption +(s) =y(s) along a curve of class C? (with non-vanishing 
curvature) does not imply that the curve is an asymptotic curve. In fact, 
this identity holds, under the assumption «(s) > 0, if and only if U2(s) - N(s) 
= const. on I (a condition which is satisfied if, for instance, I is an asymp- 
totic curve or a geodesic). 

Let s be fixed. Then, if x(s) =| X”(s)| vanishes, (35) shows that 
a(s) =£8(s) 0. On the other hand, if x«(s) >0, then the first of the 
relations (42) shows that a(s)==0 is equivalent to cosw(s) =0, which 
implies that w(s) is a multiple of 7, and that w’(s) therefore exists and is 0 
(in a vicinity of the fixed s). In other words, a(s)==0 implies that 
x= ||, by the second part of (42), and that r=y if «540, by (48); 
conversely, x =| 8 | implies that a=0. This completes the proof of (IV), 
since, in view of the remark made after (36), the differential equation of the 
asymptotic curves is a= 0. 


10. Ona formula of Bonnet. It will now be easy to formulate a precise 
wording of Bonnet’s relation (cf. [3], pp. 397-399), referred to at the 
beginning of Section 9. 


(V) Let 8 be a surface of class C* having no elliptic points (so that 
KS=0), and let T be a curve on S which is of class C* and has, at some 
point P, an asymptotic direction and a non-vanishing curvature. Then 


(44) — = + 2roko, (x > 0, x. = 0), 


where x and + denote curvature and the torsion, and + xy and r= +(— K)! 
the geodesic curvature and the geodesic torsion, of T at P (so that, if T, denotes 


| 
us | 
| 
| 
| 
id 
e 
f 
t 


622 PHILIP HARTMAN AND AUREL WINTNER. 


the asymptotic curve tangent to I at P, then xy is the curvature and, if xo > 0, 
then 7, is the torsion of I, at P; cf. (IV) above). 


When xxy > 0, the alternative sign (+) in (44) depends on I; in fact, 
it will be clear from the proof of (44) that the + or the — holds according 
as the principal normals of I and Tr, have common or opposite directions. 

If P is a parabolic point of S, then any curve T (of class C*) through P 
having an asymptotic direction possesses, at P, either a vanishing curvature x 
or a vanishing torsion 7, since x >0 and to = +(—K)4=0 imply that 
7 =0 in (44). 

The proof of the last italicized statement, (V), proceeds as follows: 

First, a multiplication of the two determinants in (35) shows that By 
can be written as the determinant in which the first row is 1, 0, X’- N’, the 
second 0, X”- N, X”- N’, and the third 0,1,0. Hence By =— X”- N’. Since 
differentiation of the first of the relations (36) gives a’ = — X” - N’ — X’- N”, 
it follows that 


(45) a’ — 2By 
is identical with X” - N’ — X’- N”. 


Since the curve T: X =X(s) =X(u'(s),u?(s)) is on the surface 
S: X =X(u',u’), differentiations of (1) and (2) with respect to s show 
that Z’ = Zw’ and Z” = Z,u’u” + Zu’ hold for Z =X and for Z=N 
(the subscripts denote partial differentiation with respect to u‘, u*). Hence, 
the expression (45) is the sum of 


and of the bilinear form hy, (ut’u” — where hix is the scalar product 
defined by (3) or (4). This bilinear form vanishes identically, since hix == hyi. 
Consequently, the function (45) or (46) of s depends only on the point 
P: X(s) and on the direction, X’(s), of T at P. 

The identity of the two values (45), (46) was just derived under the 
hypothesis that TI is of class C*. If I is of class C%, so that x(s) and w(s) 
are of class C’, then (42) shows that (46) can be written in the form 


(47) x’ COS w — (w’ — 2y)« sin oo, 


Let this be applied to both T, and I, where I, is an asymptotic curve 
through a point, P, of S, and T is a curve, of class C* and of non-vanishing 
curvature, which is on § and is tangent to Ty at P. In view of the identity 
of the numbers (45), (46), it is seen that (45) attains, at P, the same 


| 
; 
4 
3 
| 4 
‘ 
j 
| 


+ 


GEODESIC TORSIONS AND PARABOLIC CURVES. 623 


value for as for Ty. Hence, if x,w,y,: and Ko, wo, refer to T 
and Ip, respectively, then the value of the expression (45) at the point P is 


(48) — By = koro SiN wo, (sin w = + 1), 
since a(s) = 0, hence a’(s) =0, on Ty. 


On the other hand, since [ is of class C*, another expression for the 
value of (45) at P is given by (47). In view of (43), and since y = 7, 
the identity of the values (47), (48) means that 


(49) COS w + (3879 — T) SIN w = SIN wo. 


Finally, since T has an asymptotic direction at P, it follows that a(s) —0 
at P. Hence, the first relation of (42) and the assumption x(s) 40 show 
that cosw = 0, sinw = +41. Consequently, (44) follows from (49). 


11. On a formula of Beltrami. Beltrami’s theorem, mentioned at the 
end of Section 6, states that if © is a branch of the intersection, ST, of S and 
of the plane, 7, tangent to S at a hyperbolic point, P, of S, then, in the 
notations of (V) above, 

(50) Bx 


(cf. [3], p. 398). Formally, this result of Beltrami is a consequence of 
Bonnet’s theorem, since (44) reduces to (50) if r—0O (in fact, since 
= +(— K)4, hence 


(51) 


at a hyperbolic point, division by 7, is allowed). 


Actually, this deduction of (50) from (44) is not legitimate under the 
assumptions of (V). For, on the one hand, (50) is claimed also for the 
case x = (), excluded in (38), and, on the other hand, (V) assumes that T 
is of class C*, whereas, corresponding to (iii) in (§), Section 8, the curve T 
need not be of class C? under the C*-assumption made in (V) for S. It will, 
however, be shown that the proof of (V) can be adjusted to the case of 
Beltrami’s theorem so as to dispose of the difficulties on both of these accounts ; 
so that (50) holds in its full generality: 


(VI) Let P be a hyperbolic point of a surface S of class C*, and let T 
denote a branch of the intersection ST, where T is the plane tangent to 8 
at P. Then T has at P a (continuous) curvature, x = 0, and (50) holds for 
the curvature, xo = 0, of the asymptotic curve tangent to T at P. 


| 

| 

| 

| 

| 

4 

4 

| 

| 


624 PHILIP HARTMAN AND AUREL WINTNER. 


In order to prove this assertion, (VI), suppose first that «(s), the 
curvature of I, vanishes on a set of points which cluster at P. It then 
follows from (42) that, for reasons of continuity, x= 28—0 at P. On the 
other hand, it was shown in Section 10 that a’ exists even under the present 
assumptions, and so it is seen from (42) that a’ vanishes at P. Consequently, 
the same is true of the expression (45), and therefore of the expression (46) 
as well, and so of (48). But the vanishing of (48) at P is equivalent to 
ko = 0, since (51) is satisfied (cf. the corresponding conclusion in Section 10). 
Hence (50) is true in the present case. 

In the remaining case, x(s) does not vanish near P (it may or may not 
vanish at P), hence (41) defines w(s) near P. It follows therefore from an 
obvious variant of the lemma, (§), of Section 8 that T is of class C* near P. 
But +(s) =0, since [= ST is a plane curve. Thus it is clear from (43) 
that (whether x(s) does or does not vanish at P) it is possible to define 
(at P) in such a way that w(s) remains of class C after the inclusion of P; 
in fact, 


On the other hand, since [ is in the plane tangent to S at P, it is clear 
that, near P, the vector product (U;, U2) has the constant direction of the 
normal to § at P (note that the principal normal, U2, is undefined at P if 
«=0). It follows therefore from (41) that 


(53) sinw = +1 and cosw=0 at P, 


and it is clear from (53) that a(s) —«(s) cosw(s) is differentiable at P, 
having the derivative 


(54) a’ = 0 + «(cos w(s))’p = — ko’ Sin w at P 


(even though x(s) may not be differentiable at P). From now on, all state- 
ments will refer to the point P. 

Since (54), (52) and the second of the relations (42) imply that 
a’ = xt, Sin w == — By, the value of the expression (45) is 3xrosinw. On 
the other hand, the expression (45) is identical with (46) as well as with 
(48). Consequently, the product on the right of (48) must have the value 
3x7 Sin w, just found. In view of (51), this means that 2x, sin w) = 3x sin a, 
which, according to (53) and the parenthetical alternative in (48), implies 
that | 2x. |= |3«|. This proves (50), since both curvatures x, xo are non- 
negative in (VI). 


THE JoHNsS HOPKINS UNIVERSITY. 


a 
4 


GEODESIC TORSIONS AND PARABOLIC CURVES. 


REFERENCES. 


[1] S. Cohn-Vossen, “ Die parabolische Kurve,” Mathematische Annalen, vol. 99 (1928), 
pp. 273-308. 


[2] » Zentralblatt fiir Mathematik, vol. 9 (1934), pp. 373-374. 
[3] G. Darboux, Legons sur w théorie générale des surfaces, vol. 2 (1889). 


[4] P. Franklin, “ Regions of positive and negative curvature on closed surfaces,” 
Journal of Mathematics and Physics, vol. 13 (1934), pp. 253-260. 

[5] J. Hadamard, “ Sur l’itération et les solutions asymptotiques des équations différ- 
entielles,” Bulletin de la Société Mathématique de France, vol. 29 (1901), 
pp. 224-228. 

{[6] P. Hartman and A. Wintner, “On the fundamental equations of differential 
geometry,” American Journal of Mathematics, vol. 72 (1950), pp. 757-774. 


[7] 


and A. Wintner, “On the asymptotic curves of a surface,” ibid., vol. 73 

(1951), pp. 149-172. 

{8] E. R. van Kampen, “A remark on asymptotic curves,” ibid., vol. 61 (1939), pp. 
992-994. 

{[9] W. Rinow, Jahrbuch iiber die Fortschritte der Mathematik, vol. 60, (1934), pp. 
613-614. 

{10] A. Wintner, “On isometric surfaces,’ American Journal of Mathematics, vol. 74 

(1952), pp. 198-214. 


q 625 
i 
| 3 
i 
4 
a 


ON THE THEORY OF GEODESIC FIELDS.* 


By Hartman and AvuRrEL WINTNER. 


1. Geodesics. On a sufficiently small (u, v)-domain, say on 
(1) bo: <a’, 
consider the Riemannian geometry defined by a line element 
(2) ds? = H(u, v) du? + 2F(u, v)du dv + G(u, v) dv’, 
which is positive definite (i. e., 


(3) EG—F?>0 and E> 0, 


hence G > 0), but such that the functions £, F, G of (u,v) are just con- | 
tinuous. A geodesic I must then be defined as a Jordan arc contained in 6, | 


and having the property that, if P and Q are the end points of I, and if A 
is any Jordan are joining P to Q within @,, then 


fial<f | ds |. 
T A 


If P and QP are close enough to the center, (0,0), of the (u, v)-domain 
(1), then, according to Hilbert ([9]; ef. [2], pp. 419-438), there exists a 
geodesic fT —I'(P,Q), which is a rectifiable Jordan are. (Actually, Hilbert 
assumes that the line element (2) is embedded into a Euclidean (2, y, z)-space 
as the dx? + dy” + dz? on a surface of class C1, but this assumption is nowhere 
used in his proof.) 


One of the difficulties is that, no matter how close Q be chosen to a | 
fixed P, the geodesic joining Q to P need not be unique, not even if the | 
coefficient functions of (3), instead of being just continuous, are of class C'; |7 


ef. [7], pp. 132-133. If they are of class C’, then the Christoffel coefficients 
Ti, =T*,(u,v) exist and are continuous, and, as pointed out in [7], 
pp. 134-135, all geodesics must satisfy the standard differential equations 
ul”? + Tin == 0, where (u',u?) = (u,v). 

Since the coefficient functions of (2) will not be required to possess 


derivatives, no differential equations for the geodesics will be available. Never- 4 
theless, there will be proved several central theorems of the classical theory 3 


* Received November 12, 1951. 


626 


| 
| 
| 
] 
a 
g 
tk 
Bu 
is 
of 
wh 
an 
Jol 
im 
q 


ON THE THEORY OF GEODESIC FIELDS. 627 


(a theory the methods of which assume the coefficient functions of (2) to be 
of class O?, at least, and often even smoother). The theorems in question are 
those of Gauss on orthogonal trajectories, the related theorem of Jacobi con- 
cerning multipliers, Riemann’s invariant relations which are both necessary 
and sufficient (Herglotz) for normal geodesic coordinates, and Beltrami’s 
characterization of the non-Euclidean and spherical metrics as geodesic maps 
of the Euclidean geometry. 

Even if the coefficient functions of (2) were assumed to be of class C’, 
the metric defined by (2) would not, in general, have a Gaussian curvature 
K =K(u,v) (in fact, the classical definition of K applies only if H, F, G 
are of class C?). In particular, Jacobi’s equation of the normal displacements, 
d’n/ds? + K(s)n =0, which defines conjugate points, is not available. 

In problems dealing with the geodesics of a metric (2) in which EF, F, G 
are just continuous, a complication is presented by the circumstance that a 
geodesic arc I cannot be assumed to be a curve of class C or, for that matter, 
such as to possess a tangent at each of its points (rather than just almost 
everywhere, I being rectifiable). Actually, these questions were left undecided 
in [7], pp. 144-148. (It was shown there, among other things, that every I 
must possess the “Archimedian property ”; and while it is easy to see that 
this property of a rectifiable Jordan arc does not imply the existence of a 
tangent at every point of the arc,’ it seems to be less easy to “ embed ” such 
an are into a metric (2) for which the arc becomes a geodesic.) Fortunately, 
these matters will not complicate the situation, since the nature of all the 
problems considered is such as to imply the C?-character of those particular 
(even though possibly not of all) geodesics T of a metric (2) to which the 
assertions of the theorems refer. 


2. Transversals. The principal results will depend on (the wording or 
the proof of) a lemma concerning transversal fields; cf. [6], pp. 147-151. 
Under the classical assumptions of differentiability, the assertion of the lemma 
is nothing but a theorem of Jacobi, and is then a corollary of the theorem 
of Gauss concerning the transversal trajectories of a sheaf of geodesics (cf. 


*In order to obtain such a Jordan arc in a (u,v)-plane, it is sufficient to put 
u= cos ¢, v = rsin ¢, and to assign the Jordan arc in the parametric form r= f(¢), 
where f(¢), with f(¢:) f(¢2) when ~ is positive, is of class C1 for 
and tends sufficiently fast to 0 as ¢>+© (so that the point (u,v) = (0,0) of the 
Jordan are belongs to ¢ = — © and ¢ = ©). In fact, an easy calculation shows that the 
choice f(¢) = exp(—¢? ) will do (the slower logarithmic spiral, f(¢) = exp(— |¢]|), 
will not do; the non-differentiability of the latter function at ¢=0O is of course 
immaterial) . 


? 


4 
I 
4a 
a 
2 
; 
3 
: 
e 
4 


628 PHILIP HARTMAN AND AUREL WINTNER. 


[3], vol. II, p. 430). But the point is that, under the general assumption (*) 
below, the classical proofs must fail to apply. The lemma in question is as 


follows: 


1. Suppose that 

(0) E, F, G in (2) are continuous functions satisfying (3) on a suffi- 
ciently small domain (1), and that 

there exists, on that domain, a function, say (u,v), which ts of class C* 
and has the following properties: 


(i) all solutions of the differential equation 
(4) dv/du = $(u, Vv) 


represent geodesic arcs, T, of (2) and 


(ii) if a geodesic arc, T, of (2) possesses a tangent at some point and 
has, at that point, the same direction as a solution path of (4), then the latter 
solution path is identical with the arc T. 


Under these assumptions, there exist on (1) positive, continuous multi- 
pliers p= yp (u,v) for the Pfaffian 


(5) (B+ F$)du+ (F + G$)dv; 
in fact, the function 
(6) (E+ + G¢*)4 


(which, in view of (0), 1s continuous and positive) is such a multiplier. 


In other words, there exists on @,: u® + v? < a? a function r= r(u, v) 
(unique to an additive constant, say to its value r(0,0) at the center of 6.) 
such that r possesses on @, the partial derivatives 


(7) Ty = p(L+ Fo), Ty = + Go), 


where » denotes the function (5). 
Since the vanishing of the Pfaffian (5), i.e., the differential equation 


(8) (E + F$)du + (F + Gp)dv = 0 


for v = v(w) or u = u(v), characterizes the transversals of the sheaf of curves 
defined by the differential equation 


(9) dv — ¢du = 0, ; 


the assertion of Lemma 1 implies that the differential equation of the trans- 
versals of the geodesic sheaf defined by (4) possesses some (continuous, non- 


a 
1 
7 
| 
A 
= 


629 


ON THE THEORY OF GEODESIC FIELDS. 


vanishing) integrating factor, »—yp(u,v). Not even this is obvious, since, 
if nothing but (0) in Lemma 1 is assumed, and if (5) is written in the form 
w= M(u,v)du+ N(u,v)dv, where M and WN are just continuous, then the 
Pfaffian » need not possess a multiplier in any sense; cf. [12], Section 7. 
Correspondingly (and in view of Section 7 below), it turns out that the 
content of Lemma 1 is substantially equivalent to the following statement: 


Lemma 2. Under the assumptions of Lemma 1 (and if a in &@a: 
u2tv? < a’ is small enough), there exists on 6q a pair of functions, 


(10) u* == u* (u,v), v* = v* (u,v), 


such that the transformation (10) is of class C1 and of non-vanishing Jacobian, 
and maps the neighborhood £4 of (u,v) =(0,0) on a neighborhood of 
(u*, v®) = (0,0) in such a way that the metric (2) acquires a “ geodesic” 
form 


(11) ds? = du** + g(u*, v*) dv*?, 
where g is a positive, continuous function. 
Lemma 1 has a partial converse, as follows: 


LemMa 3. Suppose that (2) satisfies (0) in Lemma 1, and that a func- 
tion (u,v), which is of class C* on 6a, has the property that the Pfaffian 
(5) possesses the multiplier (6) on @q. Then every solution of (4) represents 
a geodesic of (2). 


It remains undecided whether the C1-assumption imposed on ¢(u, v) in 
Lemmas 1-3 can be reduced in some extent (for instance, to the extent of 
requiring only that (u,v) be continuous and such that the solutions of (4), 
belonging to an initial condition, are unique). 


8. Reduction to parallel segments. By adapting a scheme from analytical 
mechanics (in a highly differentiable case), it will be convenient to arrange 
the proofs in such a way that the sheaf of geodesics which is defined by (4) 
is assumed to be a sheaf of line segments 


(12) v = const., 

which means that 

(13) o(u, v) =0 

in (4). It turns out that it.can be assumed that, besides (13), 
(14) F(0,v) =0 


in 


| 

| 

| 
er 

| 
) P 
s 
‘4 
o 3 


630 PHILIP HARTMAN AND AUREL WINTNER. 


In order to make applicable the simplification afforded by this scheme, 
it will first be proved that, under the assumptions of Lemma 1, 2 or 3, there 
always exists an admissible (uw, v)-transformation of class C1 which leaves 
the assumptions unchanged and leads to the normalizations (13)-(14). 


Proof. After an affine transformation of (1) (and if @ in the new 
qa: u? + v? <a is small enough), it can be assumed that the coefficient 
matrix of (2) is the unit matrix at the center of @a, 


(15) E(0,0)=1, F(0,0)—0, G(0,0)—1. 


Then the function F + F¢ of (u,v), being 1+ 0-¢4((0,0)> 0 at the center 
of @., will satisfy 


(16) E+F¢>0on 


if a is small enough. Hence the differential equation (8) can be written in 
the form u’ =f(u,v), where f is continuous on @, and the prime denotes 
d/dv. Consequently (8) has, for small | v|, say for | v| <b, at least one 
solution u=wu(v) satisfying u(0) 0. Let y(v) denote such a solution 
of (8). Thus 


(17) =y(v), where —b<v<b and y(0) 


On the other hand, since ¢(u,¥v) is of class C1, the differential equation 
(9), which is (4), has a unique solution v = v(u) = v(U; Uo, Vy) satisfying 
V(Uo) =o, Whenever (t,o) is close enough to (0,0), and the function 
V(U; Uo, Vo) exists and is of class C* in its three arguments together, if (to, vo) 
is restricted to a sufficiently small circle about (0,0), and w to a sufficiently 
short interval —c <u<_e (the length of which is independent of wp, v). 

In terms of the function v(u; uo, vo) and of the function y(v) occurring 
in (17), define, for small | a|, | 8 |, two functions, U and V, by placing 


(18) U(a,B)=a+y(8), V(a,B)=v(a+y(B); y(B),8), 
and consider the transformation 


(19) u =U (a, v= V(a, 8). 


This mapping of a neighborhood of (a, 8) = (0,0) on a neighborhood of 
(u,v) = (0,0) is of class C* (since the functions y(v), v(U; Uo, Vo) occurring 
in (18) are), and the Jacobian of the transformation (19) does not vanish at, 
hence near, (0,0). In fact, the Jacobian 0(U, V)/0(a, B) of (18) is Vg — y’9, 
where 


Vp = /08 = + + 40/05. 


ON THE THEORY OF GEODESIC FIELDS. 631 


Since at (WU; Uo, Vo) = (0; 0,0) becomes 0 or 1 according as Wy = Uo 
OF Wo = it follows that the Jacobian at (a, 8) (0,0) is0+0+4+1-0. 

The meaning of the transformation (18)-(19) (which corresponds to 
the “ transformation to the rectilinear motion ” in the Hamilton-Jacobi theory) 
is as follows: For a given (t,¥), the solution path v—=v(w;%,%) of 
(9) meets the solution path (17) of (8) at a unique point (y(Bo), Bo). 
The inverse of the transformation (18)-(19) is (wo, v9) —> (ao, Bo), where 
ay = Uy — y(Bo). For a fixed B, the arc (19) is a solution path of (9). 

It is seen either from this interpretation or from a direct calculation 
that, if (e, f, 9) is the (a, B)-representation of the covariant tensor (LZ, F, G), 
i.e., if (2) is identical with 


ds* = ¢(a, B)da® + 2f(a, B)dadB + g(a, B) dp 


by virtue of the C*-transformations,(18)-(19), then the functions e, f, g are 
identical with 


By + Fy By? 


respectively. It is clear from (18) that (4) is transformed into dB/da = 0, 
which means that (13) will hold if (a, 8) is called (u,v). In addition, since 
the arc u = becomes the transversal (17) when a = 0, condition 


(14) will be satisfied if a, 8;f are called u,v; F. Finally, the Pfaffian (5) 
and the function (6) are transformed into w* —eda+ fd and p* —e-4, 
respectively. Since the property of a Pfaffian to be an exact differential and 
the property of an arc to be a geodesic are invariant under a local C*-trans- 
formation (u,v) — (a,8) of non-vanishing Jacobian, this proves that the 
assumptions (13) and (14) will not involve a loss of generality in the proof 


of Lemmas 1-3. 


4. Proof of Lemma 1. In view of (13), the differential equations (9) 
and (8) reduce to 
(20) dv/du = 0 
and 
(21) du/dv = — F(u, v)/E(u, v), 0), 
respectively, and since the function (6) becomes » = H-4, the pair of relations 
(7) simplifies to 
(22,) ry, = F4(u, v) ; (222) ty = F(u, v)/E4(u, v). 
Hence the assertion of Lemma 1 is that, under the assumptions (0) and 


(i)-(ii), there exists on @, a function r—r(u,v) of class C* satisfying both 
(22,) and (222). 


ne, 
are 
yes 
ew 
ant 
ter 
Les 
me 
q 
ion 
ng 
ion 
Vo) 
tly 
0) 
ing 
of 
ing 
at, 


PHILIP HARTMAN AND AUREL WINTNER. 


It will be proved that such an r(u,v) is given by 


(23) v) BA(t, v) dt. 


It is clear that the function (23) is continuous and that its partial derivative 
Ty (u,v) exists and satisfies (22,) (and is therefore continuous). What is not 
clear is that the partial derivative r,(u, v) occurring in assertion (22.2) exists 
at all; in fact, the function integrated in (23) is just continuous. 

With reference to a fixed number r, consider the equation 


(24) r(u,v) =r(=const.). 


In view of (23), this equation is satisfied at (u,v) = (0,0) if r—0. It 
follows therefore from (22,), where E0, and from standard facts on 
implicit functions, that (24) defines, in a neighborhood of (v;1r) = (0;0), 
a unique continuous function 


(25) 


in such a way that a triple (u,v;7) sufficiently close to the triple (0,0; 0) 
will satisfy (24) if and only if the u in (u,v; 1) is of the form (25). Further- 
more, since = 0 in (22,), the function (25), defined by (24), has a con- 
tinuous partial derivative with respect to r, and this derivative is given by 


(26) ur(u;7r) = H-4(u, v), where u=u(v;1r). 


It will be proved that 


(a) there exists a positive number d having the property that, if r is any 
fixed value for which |r| is sufficiently small, then the continuous function 
(25) exists for |v| < d and represents a solution of (21). 


If (a) is granted, then the proof of Lemma 1 can be completed as 
follows: (a) implies that the function (25) has, in a neighborhood of 
(v;r) = (0,0), a partial derivative with respect to v, and that this derivative 
satisfies (21), i.e., that 


=—F (u,v) /E (u,v), where u—u(v;1r), 


(27) 
which implies that this derivative is continuous in v and r together. Since 
(24) and (25) are equivalent, it follows that the function (23) of (u,v) 


is of class‘C*. Finally, (222) follows from (25), (26) and (27). 
In order to prove (a), it will first be shown that 


632 
0 


ON THE THEORY OF GEODESIC FIELDS. 633 


(B) if a point (u°,v°) is sufficiently near the point (u,v) = (0,0), 
then the distance, fds, from the point (u°, v°) to the line u = 0 ts minimized 


by the geodesic v= v°. 


5. Proof of (8). This will be the only part of the proof of Lemma 1 
in which the assumption (ii) is used. 

First, if (u°, v°), where u°=£0, is a point close enough to the center 
of 64: u®?+v* <a’, then the existence of a rectifiable Jordan arc, say 
r=T(u°,v°), minimizing the distance (— fds) of (u°,v°) from the line 
u = 0, follows by the same procedure as the existence of an arc minimizing 
the distance between two given points which are close enough (Hilbert). If 
there are more than one [—TI'(u®, v°), choose one of them, and denote by 
(0,v*) the point at which this T reaches the line wu—0. It will be shown 
that I’ is transversal to the line uO at the point (0,v*). Then the 
assumption (ii) will assure that T is the geodesic v= v*; cf. (12), (13). 
This in turn will imply that v* = v° and that I is, therefore, the geodesic 
v=v°, as claimed by (8). 

Suppose if possible that [=I (u°,v°) is not transversal to the line 
u=0 at the point (0,v*). Then I either does not have a tangent at (0, v*) 
or, if it does, that tangent has a direction which fails to be transversal to 


u=0 (by’a tangent is meant a unilateral tangent, since T can be assumed 
to end at the point v = v* of the line wu—0). Under either of these hypo- 
thesis, it is seen from (14) and (21) that there exists on T a sequence of 
points (%;,:), (U2, V2),* which converge to the point (0,v*) and have 
the property that the inequality 


(28) | (tn — v*) | > 6, 

holds for a positive c which is independent of n (~1,2,---). Since u°0, 
it can be assumed that u° > 0, and also that (28) is replaced by 

(29) Un — > > 

(the other possibilities can be treated similarly). 


Accordingly, the proof of (8) will be complete if it is shown that the 
existence of a c >0 satisfying (29), where (Un, Un) > (0,v*) as no, 
leads to a contradiction. The latter will result from the fact that (29) leads 


to the following conclusion: 


(?) If m is large enough, then the distance fds between the two points 
(0,0), (tn; Vn), When measured along the geodesic v = vp, is shorter than 
g g 


ve 
ot 
ts 
It 
on 
), 
) 
n- 
n 
as 
of 
ve 
ce 
v) 


634 PHILIP HARTMAN AND AUREL WINTNER. 


the distance fds between the two points (0,v*), (Un, %m), when measured 
along v°). 


In order to deduce the latter assertion, (?), subject the (u, v)-plane to 
an affine transformation (u,v) — (z,y) in such a way that (2) becomes 
ds* = dx* + dy? at the point (u,v) = (0,v*). Then the distance fds along 
any geodesic which joins any two points is (1 -+ 0(1))d as those two points 
tend to the point (0,v*), where d denotes the Euclidean distance in the 
(xz, y)-plane between the two points; cf. [7], pp. 144-148. On the other 
hand, it is clear that the Euclidean distance between the two points (0, v*), 
(Un, Un) is not less than (1-+ 0(1))/sina times the Euclidean distance 
between the two points (0,%n), (Un, Un), if, with reference to the positive 
number ¢ occurring in (29), the a =a, in 1/sina, where 0 < a < 4r, denotes 
the angle between the (xz, y)-images of the two lines v—v* = cu, u=—0, 
Since 1/sina > 1, and since (Un, Vn) —> (0, v*) as n—>0, the preceding two 
o-relations imply the truth of (?). 

The proof of (8) is now complete, since (?) contains a contradiction. 
In fact, if nm is large enough, and if I, denotes the path consisting of the 
portion 0 = u S ux, of the geodesic v = v, and of that portion of T = I'(w’, v’) 
which joins (u°,v°) to (Un, Un), then (?) implies that the length fds of the 
path T,, a path joining a point of the line u—0 to the point (u°, v°), is 
shorter than the length fds of T(u°,v°). But this contradicts the definition 
of T'(u®, v°). 


6. Proof of (a). Suppose if possible that the assertion of (a) is false. 
Then there exists on the arc (24) or (25) a point (uo, vo), arbitrarily close 
to the point (0,0), in such a way that the arc either does not have a tangent 
at (Uo, Vo) or, if it does, that tangent has a direction which fails to be trans- 
versal to the geodesic vv». It can be assumed that w40, say up > 0. 
For, if r = 0, then (23) shows that (25) is the are u—wu(v; 0) =0 which, 
in view of (14), represents a solution of (21). 

In order to simplify the notations, subject the (wu, v)-plane to an affine 
transformation after which the metric (2) reduces to ds? = du? + dv? at the 
point (u,v). Then, in both of the cases negating the truth of (a), there 
exists on the arc (24) or (25) a sequence of points (u,v), (wu, v?),- °° 
which converge to the point (wo, v)) and have the property that 


(30) | — v |/| uy | <c, where > 0, 


ON THE THEORY OF GEODESIC FIELDS. 635 


holds for a positive c which is independent of n(— 1, 2,-- -). Corresponding 
to the reduction of (28) to (29), it can be assumed that (30) is replaced by 


(31) 0S < (n= 


With reference to the constant c > 0 occurring in (31), let A= A(t, U9; €) 
denote the line v —v) = (—1/c)(u»—wu). Then it is clear from (u", v") 
— (Uo, Vo) that, if n is large enough, the line A will have on the geodesic 
v =v" some point, say the point (tn, v"). 

If the results of [7], pp. 144-148, are applied in the same way as in 
Section 5 above, it now follows that if n is large enough, the distance fds 
between the two points (wo, Vo), (Un, v"), measured along the line A, is shorter 
than the distance fds between the two points (u",v"), (Un, v"), measured 
along the geodesic v = vy. Hence there results a path consisting of a portion 
of the geodesic v = vp and of a segment of A and having the property that, 
while it is a path which joins the point (0,v)) to the point (t,,v"), it is 
shorter than the portion of the geodesic v = v" which joins (0, v) to (Un, vu"). 
This contradicts (8) and therefore proves (a). 

The proof of Lemma 1 is now complete. 


7. Proof of Lemma 2. In view of Section 3, it is sufficient to prove 
Lemma 2 under the normalizations (13)-(15). Then, according to the proof 
of Lemma 1, the function (23) is of class C* and satisfies (22,)-(222). 
In terms of this function, define a transformation of a neighborhood of 
(u,v) = (0,0) into a neighborhood of (r,v) = (0,0) by placing 


(32) r=T(u,v), v=. 


The Jacobian of the C'-mapping (32) is 1-r,—0, which, in view of (22,), 
is distinct from 0. 

Let the covariant metric tensor of (2) be expressed in terms of the 
coordinates (32). Then it is readily seen from (22,)-(222) that 


(33) ds? = dr? + (HG — F?) E“dv? 


is an identity by virtue of (2) and (32). But (33) is of the desired form 
(11), with u* =r, v* =v, and g = G— F’/E, where the functions F, F, G 
of wu and v are thought of as expressed, by means of the inverse of the sub- 
stitution (32), as functions of r and v. 


8. Proof of Lemma 3. In view of Section 3, it is sufficient to prove 
Lemma 3 under the assumptions (12)-(15). Then (14) and the assumptions 


ad 
to 
es 
of 
ts 
he 
er 
), 
ce 
ve 
es 
0. 
v0 
n. 
°/) 
e 
is 
on 
e. 
se 
nt 
S- 
0. 
h, 
e 
re 


636 PHILIP HARTMAN AND AUREL WINTNER. 


of Lemma 3 mean that (23) is a function of class C’ satisfying (22,)-(222). 
Section 7 shows that (32) transforms (2) into (33). But (33) is of 

the form 

(34) ds? = dr? + g(r, v) dv’, where g(0,0) —1, 


by (15), and it is clear from (34) that, if r?-+ v? is small enough, every 
segment (12) minimizes the distance fds between any two points of that 
segment. This proves Lemma 3. 


9. Normal coordinates. Suppose that a metric (2), satisfying (3), is 
given in terms of normal coordinates (u,v) at (0,0) (Riemann). By this 
is meant that 


(35) u =r cos 9, v=rsiné 


represents a geodesic for every fixed @ and that, in addition, the r= 0 occurring 
in (35) is, except for a factor which depends only on 6, identical with the arc 
length on each of the geodesics 6 = const. which issue from (u,v) = (0,0). 
If the functions E(u,v), F(u,v), G(u,v) possess partial derivatives of a 
sufficiently high order, then, according to Gauss [5], pp. 249-250 (and, in the 
multi-dimensional case, Riemann [11], p. 279; ef. Dedekind’s comments [4], 
pp. 406-407), the differential equations of the geodesics (that is, the equations 
of motion 


(36) [L]u=9, [L]o=0, 

where the brackets denote the Lagrangian derivatives of 

(37) v’) = 4E(u, v)u? + F(u, v)u’v’ + $4 (u, v)v? = 48? 
and the prime denotes d/dt) must possess the invariant relations 

(38) Ly(u,v;u,v) = Ly(0, 0; u, v), L,y(u, v; U, v) = Ly(0, 0; u, v) 


whenever (u,v) is a normal coordinate system at (0,0), and, as pointed out 
by Herglotz ([8], p. 216), this necessary condition for normal coordinates is 
sufficient as well. 

Clearly, neither the definition of coordinates which are normal coordinates 
nor the pair of relations (38) (which, in view of (37), simply mean that 


(39) E(u, v)u+ F(u,v)v =u, F(u, v)u+ G(u, v)v 


if, without loss of generality, the normalization (15) is used) involves anything 
like the italicized hypothesis, that concerning a sufficiently high degree of 
smoothness of the coefficient functions of (2). Correspondingly, as a relevant 


3 
§ 
3 
4 
4 
4 
4 


‘ 
& 


ON THE THEORY OF GEODESIC FIELDS. 637 
illustration of the above theory of geodesic fields in a metric which is just 


continuous, it will now be shown that the criterion holds without any differ- 
entiability assumption on the coefficient functions of (2). 


(*) Suppose that E(u,v), F(u,v), G(u,v) are continuous functions 
satisfying (3) in a neighborhood of (0,0), and that they are normalized at 
(0,0) by (15). Then the pair of identities (39) 1s necessary and sufficient 
in order that the coordinates u, v be normal at (u,v) = (0,0). 


The proof of the necessity and sufficiency of (39) could be deduced from 
Lemma 3 and Lemma 1, respectively. Corresponding to the circumstance that 
Lemmas 1-3 and (*) deal with “ geodesic parallel” and “ geodesic polar” 
coordinates, respectively, such a deduction of (*) would, however, lead to a 
complication, since what represents (4) in (*) is the differential equation 
dv/du = v/u, which has a singularity at u = 0 (corresponding to the vanishing 
of the Jacobian of (35) at r==0). Because of this formal complication, it 
will be just as convenient to prove (*) directly. 


10. Proof of (*). <A direct substitution shows that (35) transforms 
(2) into 
(40) ds? = e(r, 0) dr? + 2f(r, 0) drd6 + g(r, 0) dé’, 


where, in (binary) vector and matrix notations, 

(41) ( ( cos 6 F 
—sinéd cosO/\F G/\sin6/’ 

while 

(42) g/r? = E sin® 6 — 2F sin 6 cos 6 + G cos? 6. 


In order to prove the sufficiency of (39), suppose that the coordinates 
u, v satisfy (39). Then (35) shows that (41) can be written in the form 


( )=( cos@ sin 

J] \—siné od 

(where 1 cos?6-++sin?@). Since this means that e=1 and f=0, it 
follows that (40) reduces to 


(43) ds? dr? g(r, 0) dé? (g> 0). 


But (43) makes it trivial that the distance fds between the point r—0 and 
any point (1, 6), where 7) > 0, is minimized by the path 6 = const. (= 6), 
and that r is the are length along the path. This means that the coordinates 
(41) are normal at (0,0), as claimed by the second assertion of (*). 


| 
| 
4 
4 
3 
at 4 
3 
1S 
. 
| 
5 
c 
. 
a 
| 
e 


638 PHILIP HARTMAN AND AUREL WINTNER. 


In order to prove the necessity of (39), which is the first assertion of (*), 
it is sufficient to show that e—1 and f 0 are identities in (u,v) if the 
coordinates u, v are normal at (0,0). In fact, if the constants e—1, f —0 
are substituted into (41), then (41) reduces to a pair of identities which, 
in view of the definition (35), is precisely (39). 

According to the first line of (41) and the normalization (15), the value 
of e(u,v) at (u,v) = (0,0) is cos?@-+ sin?6@—1. On the other hand, since 
the coordinates u,v are supposed to be normal at (0,0), the equations (35) 
represent, for every fixed 0, a geodesic on which the arc length, when measured 
from (0,0), is of the form scr, where the positive number c—c(@) is 
independent of r. Hence it is clear from (40) that e(u,v) is independent 
of (u,v). In view of e(0,0) —1, this proves that e(u,v) is the constant 1. 

Consequently, only the identical vanishing of f remains to be proved. 
But (40) shows that the identical vanishing of f is equivalent to the statement ? 
that every arc r = Const. is transversal to every geodesic 6 const. Suppose 
if possible that this transversality fails to take place somewhere, i.e., that 
there exist an 7) > 0 and a 6 = @, for which the are r—7, is not transversal 
to the geodesic 6 =, at the point (7,0). Then, by using results of [7], 
pp. 144-148, in the same way as above (Section 5), it follows that there exists 
a sequence of points (fn, #,) which tend, as n—>0, to (1, 6.) and possess the 
following property: Whenever n is large enough, the point (Tn, 0.) can be 
joined to the origin (r= 0) by an arc along which the length fds is less 
than r,. This contradicts, however, the assumption, according to which the 
arc 6 = 6, joining the point (rn, 6,) to the point r—0 is a geodesic arc of 
length rp. 


11. Jacobi’s multiplier. While Lemmas 1-3 concern themselves with 
the equation (8) of the transversals of the geodesics defined by (9), the 
following lemma deals with (9) itself (but assumes that the function ¢ of 
wu and v contains a parameter also). 


Lemma 4. Let 6=¢(u,v;w) be a function of class C1 on the product 
space of a sufficiently small domain @,: u? + v? < a? and of an interval 
| w| <b, and suppose that the functions E, F, G and ¢ satisfy assumptions 
(0) and (i)-(ii) of Lemma 1, when w is fiaed. Then, for fixed w, the con- 
tinuous function 


As) A= (EG — F*) (E + + 


* This is the analogue of assertion (a), Section 4, in the proof of Lemma 1. Nothing 
like assertion (8), Section 4, is involved in the present case, since the arc u=0, 
occurring in (8), now degenerates to the point r= 0. 


ON THE THEORY OF GEODESIC FIELDS. 639 


of (u,v) represents a multiplier of (9). What is somewhat more, there exists 
a function R= R(u,v;w) which is continuous on the product space of ba 
and | w| <b, is of class on and satisfies the relations 


(45) Ry R, = A. 


(The vanishing of the partial derivative $y, and therefore that of the multiplier 
(44), ts not excluded.) 


If F, F, G, instead of being just continuous, are sufficiently smooth, then 
Lemma 4 reduces to Jacobi’s theorem concerning a “ last multiplier ” (in case 
of geodesics) ; cf. [10], p. 498 and [3], vol. II, p. 431. 

Lemma 4 will be essential in proving, without the usual assumptions of 
differentiability, a fundamental theorem on non-euclidean geometries; cf. 
Section 12 below. 


Proof of Lemma 4. If w is fixed, then, according to assertions (5)-(6) 

of Lemma 1, the function 
(u,v) 
(46) r(u,v;w) — + (F + 
(0,0) 

satisfies (7) on 6a. Since ¢(u,v;w) is supposed to be of class C* (with the 
inclusion of w), the function (6) has the continuous partial derivative 
Pu = — (F + Gd) p dw. Hence, if (46) is differentiated with respect to w, 
a straightforward calculation gives 


(u,v) 
(47) (U,V; WwW) (Adu — Addv), 
(0,0) 
if X is defined by (44). Since (47) means that (45) is satisfied by the 
function 
(48) R(u,v;w) = v; wv), 


the assertions of Lemma 4 follow. 


12. Beltrami’s theorem. If all geodesic arcs of a metric (2) are seg- 
ments of straight lines in a domain of some (u,v)-plane, then the metric is 
of constant curvature. This is a classical theorem of Beltrami ([1], pp. 262- 
280; ef. [3], vol. III, pp. 41-47). His proof and its variants assume, how- 
ever, that the coefficient functions of (2) have a sufficiently high degree of 
differentiability in (u,v). Without such an assumption, the curvature K of 


| 
e 
0 
le 
| 
) 
d 
iS 
it 
2 
t 
e 
f 
t 


640 PHILIP HARTMAN AND AUREL WINTNER. 


(2) cannot even be defined, and the proofs need substantially stronger restric- 
tions of differentiability than what suffices for equating K to the differential 
operator assigned by the Theorema Egregium. On the other hand, in view 
of the fundamental geometrical significance of Beltrami’s result, it seems to 
be essential that the theorem should be formulated and proved without any 
assumption of differentiability, as follows: 


(**) Suppose that the coefficients of (2) are continuous functions satis- 
fying (3) on a (u,v)-domain, say on u? +0? < and that every 
sufficiently short segment (of a straight line, cyu+ contained in 
6a ts a geodesic arc of (2). Then the (given) coefficients of (2) are analytic 
functions of (u,v) on and the curvature K = K (u,v) of (2) ts inde- 
pendent of (u,v) on @a. 


As mentioned in Section 1, a point and a direction do not in general 
determine a (unique) geodesic of a metric (2), not even if the functions £, 
F,G of (u,v) are of class C? (which is not assumed in the present case). 
Correspondingly, a substantial part of the proof of the general formulation 
(**) of Beltrami’s theorem will consist in showing that this and similar 
possibilities are excluded by the last assumption of (**). To this end, the 
following lemma will be needed: 


(t+) Under the assumption of (**), every sufficiently short geodesic arc 
of (2) ts a segment (of a straight line, = Cy) in Ba. 


The proof of the latter assertion, (+), will depend on the steps used in 
the proof of Lemma 1. 


13. Proof of (f). Let T: (u—u(t),v=—v(t)), where 0S¢51, 
be a geodesic of (2) joining a point, say (Uo,v), of @q to another point 
of @,. Without loss of generality, the latter point can be assumed to be the 
origin, (0,0), since a is meant to be sufficiently small. The assertion of (7) 
is that the arc I must be on a straight line, (35), where 6 = 0(wo, Uo) is 
constant on T, and r—=r(t) 20,0Sf1. 

With reference to an interior point ¢* of the given parameter range 
0=t¢=1 of let T,, denote the respective portions 0 [¢S ¢*, StS1 
of the given geodesic, fT =T, + T2, and let I, I” be the segments (of straight 
lines) which join the point P* = (u(t*), v(t*)) of T to its respective points 
(u(0),v(0)) = (0,0), (u(1), v(1)). Then T*, where 1 = 1, 2, has the same 
length fds as T;. For, on the one hand, the assumptions of (+), being those 
of (**), imply that the segment I‘ minimizes the distance fds between its 
given end points and, on the other hand, the same is true of the geodesic 


| 


ON THE THEORY OF GEODESIC FIELDS. 641 


arc T;, the end points of which are the same as those of I‘. Consequently, 
Tr, +I? and I+ T;, are of the same dength, and join the same points as the 
geodesic TT, + T., as well as the polygonal path I* + I’, and are there- 
for geodesics. 

Since I + I” is a geodesic containing the point P* = (u(t*), v(¢*)), 
an application of arguments applied in [7], pp. 144-148 (or, equivalently, in 
Section 5 above) shows that T, + TIT? must have at P* a unilateral tangent 
line from the “right,” and that this line is that containing the segment I”. 
Since the latter determines for T, +I? a unilateral tangent line from the 
“left,” it now follows from Corollary 1 in [7], p. 145, that T, + I? must have 
at P* a tangent, i.e., that the two unilateral tangents coincide at P*. For 
reasons of symmetry, the same is true of the geodesic I’ + T., as well as of 
the geodesic +- T°. 

Clearly, this is possible only if the two segments I’, I issuing from P* 
are collinear, and if the straight line containing them represents a (bilateral) 
tangent of Tr, +T.—T at P*. Since the sum of the segments I’, I is a 
segment, IT*-+-I* which is a geodesic joining the points (u(0),v(0)), 
(u(1), v(1)), points which are independent of the choice of P* = (u(¢*), v(t*)) 
on =T, + it follows that T must be identical with the segment + T°. 
This proves assertion (f) of Section 12. 


14. Field constructions for (**). Let w be any constant, and let 
(49) w) =w 


for every (u,v) on (1). The assumptions of (**) show that assumptions (0) 
and (i) of Lemma 1 are satisfied by the case (49) of (4) (for every fixed w). 
On the other hand, (+) shows that assumption (ii) of Lemma 1 is satisfied. 
Consequently, Lemma 1 is applicable (at every fixed value of w), as is 
Lemma 4 (the function (49) is of class C* in u,v; w together). 

Since the case w =0 of (49) reduces (5) and (6) to Edu+ Fdv and 
E+ respectively, it follows from Lemma 1 that 


(50) E-4+(ELdu-+ Fdv) is an exact differential. 

On the other hand, it is seen from the case (49) of the definition (44) of A, 

and of the assertion (45) of Lemma 4, that since ¢, = wy, =1, the Pfaffian 
(EG — F*) (FE + 2Fw + Gw?)-*/ (dv — wdu) 


is an exact differential for every value of the constant w. Hence the choice 
w= 0 leads to the conclusion that 


(51) (EG — F?) E-*/2 is a function of v only. 


l 
| 


642 PHILIP HARTMAN AND AUREL WINTNER. 


In addition, since the assumptions of (**), from which (50) and (51) have 
been concluded, remain unaltered if and are replaced by v, u and 
G, E respectively, it is clear that, corresponding to (50) and (51), 


(52) G4(F du+ Gdu) is an exact differential, 
and 
(53) (EG — F*)G-*”? is a function of wu only. 


Beltrami’s proof of the assertion, K —const., of his theorem falls into 
two parts. He first concludes ([1], pp. 263-266) the preceding four relations, 
(50)-(53), under the assumption that the functions /, F, G are sufficiently 
smooth (of class C", with something like n — 2), and then, by assuming a 
still higher degree of differentiability (something like n = 5), he deduces ([1], 
pp. 266-270) the assertion, K —const., from (50)-(53). Correspondingly, 
the completion of the proof (**) will depend on a suitable adaptation of the 
latter part of Beltrami’s proof, leading from (50)-(53) to certain functional 
equations for which it turns out that their non-analytic solutions cannot be 
continuous (or, for that matter, Z-integrable). 


15. The functional equations of (**). Let U=U(u), V=V(v) 
denote, for sufficiently small | u |, | v |, the cube roots of the respective func- 
tions (53), (51). In view of (3), these continuous functions of the respective 
single variables u, v are positive. Since (53) and (51) mean that both 
products UG, VE are identical with the cube root of EG — F?, it follows 
that there exists a positive, continuous function A= A(u, v) satisfying 


(54) Fi = NU, G4 XV, 
and that a continuous function p= p(u,v) is, therefore, defined by placing 
(55) F = 


(the multipliers (44), (6) have nothing to do with the present A, »). In 
terms of (54) and (55), the remaining two relations, (50) and (52), mean 
that both Pfaffians 


(56) AU du + pVadv, pUdu+AVdv are exact differentials. 


Since U, V are positive, continuous functions of the respective single 
variables u, v, the conditions 


(57) da = Udu + Vdv, dg = Udu + Vdvu 
and a(0,0) = 0, 8(0, 0) —0 define, near (u,v) = (0,0), a pair of functions 


nd 


ms 


ON THE THEORY OF GEODESIC FIELDS. 643 


a=a(u,v), 8 =B(u,v) which are of class C1 and of non-vanishing Jacobian 
(the latter being 0(a, 8)/0(u, v) =—2UV <0). Thus (u,v) > (a,B) and 
(a, 8) —> (u,v) are one-to-one transformations, of class C’, of corresponding 
small neighborhoods of the origins of the parameter planes (u,v), (a, 8). 
Since U(w) and V(v) are positive and continuous, it follows that, in a 
neighborhood of (a, 8) = (0,0), two positive continuous functions, A and B, 
of single variables are defined by placing 


(58) 1/U(u) =A(a+B),  1/V(v) =B(a—8). 


In fact, since (57) means that d(a+ 8) = Udu and d(a— B) = Vdv, the 
functions u, v of (a, 8) depend only on a+ £, respectively. 
According to (56), both Pfaffians 


(A+ (Udu + Vdv), (A — p) (Udu — Vdv) 


are exact differentials. In view of (57), this means that the same is true 
of both Pfaffians (A + »)da, (A —p)dB (in which the functions A, p of (u, v) 
are thought of as expressed in terms of the new variables, a and 8). Conse- 
quently, the continuous functions A + p, A—yp of (a, 8) depend only on a, B 
respectively. Thus if 2a, 2b denote these continuous functions of the respective 
single variables a, 8, then 


(59) A=a(a)+5(B), w=a(a) 
Hence it is seen from the definitions of U and V, (54), (55) and (58), that 
(60) a(a) + 6(8) = 4a(a)b(B)A(a+ 8) B(a—B). 


As pointed out above, A and A, B are positive. It follows therefore from 
the first of the relations (59) and from (60) thata > 0,6 > 0. In particular, 
division of (60) by ab 0 is allowed; so that 
(61) 1/a(a) + 1/b(8) =4A(a + B)B(a—B). 


If a+ 8 and a—8 in (61) are replaced by 2a and 2, respectively, then 
an integration with respect to the new £ leads to the following identity in 


(a, 8): 
a+B a-B B 
f at/a(t) — f dt/b(t) =44 (20) B(2t) dt. 


Since the sum on the left of this identity has a continuous partial derivative 
with respect to a, the same is true of the product on the right. This means 
that A(a) has a continuous first derivative. If the réles of the original a and 


8 


| 
s 
ly 
a 
] 
he 4 
al 
pe 
c 
ve 
h 
In 
le 


644 PHILIP HARTMAN AND AUREL WINTNER. 


B are interchanged in this deduction, it follows that B(f8) has a continuous 
first derivative. Finally, a repetition of this argument shows that A(a) and 
B() have derivatives of arbitrarily high order. 

Consequently, the tacit assumptions of the caluculations of Beltrami ([1], 
pp. 266-272), referred to at the end of Section 14 above, are satisfied by 
necessity. In other words, the classical proofs of Beltrami’s theorem can now 
be repeated ; so that the proof of (**), Section 12, is complete. 


[1] 
[2] 
[3] 


[4] 
[5] 
[6] 
[7] 


[8] 
[9] 
[10] 


[11] 
[12] 


THE JOHNS HOPKINS UNIVERSITY. 


REFERENCES. 


. Beltrami, Opere Matematiche, vol. 1 (1902). 
. Bolza, Vorlesungen tiber Variationsrechnung, Leipzig und Berlin, 1909. 
. Darboux, Legons sur la théorie générale des surfaces, Paris, vol. II (1889) and 


vol. III (1894). 


. Dedekind, see [11]. 

. F. Gauss, Werke, vol. 4 (1873). 

. Hadamard, Legons sur le calcul des variations, Paris, 1910. 

. Hartman and A. Wintner, “On the problems of geodesics in the small,” 


American Journal of Mathematics, vol. 73 (1951), pp. 132-148. 


. Herglotz, Zur Riemannschen Metrik, Berichte der Sichsischen Akademie der 


Wissenschaften zu Leipzig, vol. 73 (1921), pp. 215-225. 


. Hilbert, “Ueber das Dirichlet’sche Prinzip,” Jahresbericht der Deutschen 


Mathematiker-Vereinigung, vol. 8 (1900), pp. 184-188. 


. G. Jacobi, Gesammelte Werke, vol. 4 (1886). 
. Riemann, Gesammelte mathematische Werke (ed. 1892). 
. Wintner, “ On isometric surfaces,” American Journal of Mathematics, vol. 74 


(1952), pp. 194-214. 


E 

re) 

G 

RI 

Cc 

J 

C 

A ; 


ind 


and 


1,” 


der 


3 
4 


4 
a 


NOTE ON DOUBLE-MODULES OVER ARBITRARY RINGS.* 


By Tapast NAKAYAMA. 


Jacobson’s [3] module-theoretical Galois theory of non-normal extension 
fields was generalized by Hochschild [2] to a theory of double-modules over 
sfields. The theory was further extended to the infinite-dimensional case by 
Dieudonné [1]. On the other hand, it was shown in [4] that a goodly 
portion of the theory can be transferred to the case of general rings. It is 
now natural to study the infinite-dimensional case also for general rings, 
which we propose to do in this note. It seems to the writer that there are? 
two main features in the theory; one is the characterization of relation- 
modules, and the other is the characterization of direct (Kronecker) self- 
products. With respect to the latter, our generalization is rather satisfactory 
(§5), while it is quite powerless with respect to the former; the same was 
the case with [4]. In closing the introduction, we remark that the formu- 
lation of the present note is left-right symmetric to the one in [4] (but is 
in accord with [1], [2]). 


1, Relation-modules of double-modules. Let K be a ring; by a ring 
we mean in this note always one which posesses a unit element, by a subring 
we mean one containing that unit element, and by a module, either left- or 
right-, we mean one on which the unit element operates as an identity. Let 
A be a second ring, and &f the additive group of all additive homomorphisms 
of Ainto K. is an A-K-double-module with respect to the natural operations 
defined by * 

= (xa)a, = (ra)k 
(1) 
(z,ae Aske 


Let M be a K-A-double-module, and let M* be the module of all K- 
homomorphisms of M into K, i.e. the dual module of the K-(left-)module M. 
It is an A-K-double-module according to a definition similar to the above. 


* Received May 31, 1951. 

* Besides the duality between certain subrings (over which the whole ring has 
independent (right-) bases) and certain relation-modules. 

* The operations of the elements of A, K on Y& are indicated by dots. 


645 


1, 
by 
OW 
j en i 
74 
| 


646 TADASI NAKAYAMA. 


(In fact, the above construction depends only on the fact that A is an 4- 
right-module and K is a K-right-module.) Let uw) be an element of WM. 
With oe M* we consider the element ¢ of & defined by 


(2) LE = (Upr)o (xeA). 
Denote the totality of os (o running over M*), by R(M,u,). It is an A-K- 


submodule of % as is readily seen, and we call it the relation-module of u, 
in M. We have evidently 


Lemma 1. If M is contained in a A-K-double-module M,, as K-A-module, 
such that every element of M* can be extended to an element of M,*, then 
R(M,, Uo) R(M, Uo). 


Let N be a second K-A-module, and vy an element of N. Let @ bea 
(K-A-)homomorphism of M into N. It induces in a natural manner an 
(A-K-)homomorphism ¢* : 7 > o = ¢7 of N* into M*. Suppose now ud = 2%. 
Let 7 in R(N, vo) be given by a7 = (vor)r. Then 


(3) LT = (Vor) = ( (Ud) = ( (Ur) = (Ur) o = Io. 
Thus we have the following proposition, whose latter half includes Lemma 1. 


Proposition 1. If there exists a (K-A-)homomorphic mapping ¢ of M 
into N which maps Uy into vo, then 


(4) R(M, uw) R(N, vo). 
If ¢* maps N* onto M* then R(M,u,) = R(N, v%). 
We can immediately verify 
ProposiTion 2. If uweM, we N, then 
(5) R(M@N, wo.) = R(M, uw) + R(N, v0) 
with Wo = Up + in MEN. 


Let S be a third ring, and let N be an S-K-double-module (contrary to 
the above). The direct product NX M=—WN X «kM of N, M over K is 
defined as usual, and is an S-A-double-module. With oe M*, re N* (where 
N* is the dual module of the S-module 1), we set 


(6) (v Xu) (o Xr) = (v(ue) )z, 


and observe that this essentially defines o X 7 as an element of (N X M)*, 
independent of the special expression of v X u. Let G, 7 be the elements of 


A- 


M 


DOUBLE-MODULES OVER ARBITRARY RINGS. 647 


R(M, uo), R(N, v0) (vo being an element of N), which correspond to a, 1, 
respectively. We have 


= ((Uor)o)F = (Vo((Uox)o))7 
= (Vo X(Uox))(o X 7) = ((Vo X Uo)x)(o X 7). 


Hence x—>(aa)7 is the element of R(N XK M, vo X uw) corresponding to 
oXre(N X M)*, and we have 


Proposition 3. For the product N X M, 
(7) R(M, w)R(N, vo) CR(N XK M, vo X Uo). 
Furthermore we have 


Proposition 4. Let M be a K-A-double-module generated, as K-A- 
module, by a single element uy. Let N be a second K-A-double-module such 
that for every non-zero element v of N, there exists at least one element r 
of N* satisfying vr40. Then the inclusion (4) implies, conversely, the 
existence of a (unique) homomorphis n » of M into N, such that ud = Vo. 


For a proof we observe, after Hochschild [2], that if a certain sum 
Skuoa (ke K,ae A) vanishes, then = k((ua)o) = (3% kupa)o = 0. 
If (4) is the case, then this implies =0, or (3 kv = 0, for all 
re N*, This implies in turn 3 kvya = 0, according to our assumption on N*. 
Thus wp — vo defines a (K-A-)homomorphism of M into N. 


2. Restricted relation-modules. So far, no particular assumptions have 
been made on M (nor on K, A). Let us now assume that M possesses an 
independent K-basis, say {u,}. Let o, be the element of M* such that 


(8) Uion = din (Kronecker 8’s) ; 


uo, is nothing but the coefficient of u;, in the K-linear expression of u by {un}. 
The K-combinations of o;’s (with varying h) form a K-submodule M* of M*, 
which is independent of the particular choice of the basis {u,}. Letting o 
run only over M*, we then obtain a submodule R*(M, u,) of R(M, uo), which 
we shall call the restricted relation-module of uo in M. If G is obtained 
from o;, then {,} forms a (not necessarily independent) K-basis of R*(M, up). 
Unlike R(M,u,), the restricted relation-module R*#(M,uo) is not A-left- 
allowable in general. 
Similarly to Lemma 1 we have 


Lemma 2. If M is contained in a K-A-double-module M, as K-A-module, 


M. 
Uo 
ule, 
hen 
ea 
an 
Vo. 
to 
is 
Te 
of 
4 


648 TADASI NAKAYAMA. 


and M, has an independent K-basis which contains an independent K-basis 
of M, then R*(M,, uo) = R*(M, uo). 


Next let M be homomorphically mapped into N by ¢, and ud = %. 
Suppose also WV has an independent K-basis, say {vj}. Expressing each w¢ 
by {v,;}, we see that for re N*, the element ¢r of M* is in M*. It follows 
readily that R*(N,v)) C R*(M,u.). That Proposition 2 can be transferred 
to restricted relation-modules is trivial. Thus 


Proposition 5. Our Propositions 1, 2 hold also for the restricted 
relation-modules R* (instead of R), provided that both M, N possess inde- 
pendent K-bases. 


As for Proposition 3, we observe that if {u,}, {vj} are, respectively, an 
independent K-basis of M and an independent S-basis of N, then {v; K up} is 
an independent S-basis of N KX M=—=NXx«M. Also 


(9) (0; X Un) X te) = ) Te = 


On the other hand, the element of R*#(N XK M,v. X uo) corresponding to 
o; X tx, is just the product G7,, as was seen formerly. Thus we have 


Proposition 6. Let M, N be K-A-, and S-K-double-modules, respec- 
tively, possessing independent K-, S-bases. Then 


(10) R#(M, uy) R*(N, vo) D R#(N XK M, vo XK Uo). 
We have furthermore, corresponding to Proposition 4, 


Proposition 7. Let M, N be K-A-double-modules possessing independent 
K-bases. If M is generated by wu. as a K-A-module, and if R*(M, w) 
— R*#(N, uo), then vo gives a (K-A-)homomorphism of M into N. 


Let us next consider a K-A-double-module M which (not only is generated 
by Ww and possesses an independent K-basis, but) possesses an independent 
K-basis consisting of elements contained in u,A; then we say that M is a 
special K-A-double-module, and wu, is a generator. We prove 


Proposition 8. If M is a special K-A-double-module with generator wp, 
and {up} is anarbitrary independent K-basis of M (not necessarily contained 
in UA), then {Gn} (with on, as in (8)) forms an independent K-(right-) basis 
of R*(M, 


It suffices to consider the case where {u,} is contained in u,A. Let 


(11) Up, = (t, eA). 


Asis 


pec- 


DOUBLE-MODULES OVER ARBITRARY RINGS. 649 


Then 
(12) Sni- 


It follows immediately that the c;’s are independent. That they form a K- 
basis of R*(M,u,.) has been seen before. 


3. Direct self-products of rings. We now consider the case K =A. 
Then 2% is the absolute module-endomorphism ring of A. Let S be a subring 
of A. The direct self-product M—A  X sA of A over § is an A-double- 
module (i.e. A-A-double-module), and we have 


Proposition 9. The relation-module R(M,1X 1) of M=A X gA with 
respect to 1 X 1, ts the S-left-endomorphism ring of A (1. e. the commuter in 
Y of the left-multiplication ring Sz, of S on A), denoted by F(A, Sz). 


For, with ce R(M,1X1), where oe M*, we have ((1 X1)z2)o 
=(1Xz)o. For seS we have 


= (1 X sx)o = (s X = (s(1 X 8((1 X Z)o) 


which shows that ¢e H(A, Sz). Conversely, if ae F(A, Sz), we put (x X y)o 
= 12(ya), and observe that this defines o uniquely as an element of M*, since 
(ts X y)o = (xs)(ya) = 2(sya) = (4 X sy)o. Clearly a =<. 

Now let M be an arbitrary A-double-module and uw, an element of M. 
Then the set of all x satisfying au) == u.a forms a subring of A, and we take 
it as 

(13) S={reA| = uz}. 


Then we have R(M,u.)C #(A, Sz); the proof is similar to the first half 
of the above proofs of Proposition 9 (which depended only on (1 X1)s 
=s(1X1)). In other words, if se 8, then the left-multiplication sz, (on A) 
commutes with every element of R(M,u.). The converse is also true if M 
is such that for every non-zero element wu in M, there exist.a o in M* satisfying 
uo ~ 0. 

Now we wish to know when M is isomorphic to A & gsA. To do so, let us 
assume that M is special, and uw, is its generator. Let {u,} be an independent 
A-left-basis of M contained in uA, and put up, = uot, (i.e. (11)). We have 
then (12). Let N be a second (special) A-double-module which is isomorphic 
to M. Let v, v», be its elements corresponding to wo, u,. Suppose that 
Uo —> Vo X Uo gives an (A-two-sided) homomorphism of M into N X M. 
By the homomorphism, wor is mapped on (V X Uo)% = Bao X (ron) Up 


Vo. 
OWS 
Tred 
} is 
to 
lent 
Up) 
ited 
lent 
$a 
Uo; 
ned | 
asis 


650 TADASI NAKAYAMA. 


= 3p,i( (Zon) (Vi X Un). On the other hand, = Un = 
and this is mapped on 


Sn(Zon)(Vo X Uo)tn = K Un) = (Vi X Un). 


Hence 
So, 
Uo (Ion) = Bi ( Fi) = (Ton) (104) Ui = Uo. 


Hence zo, ¢ (for every ze A and every h). Also 

ty = Uotn = Sa Un —= Uor. 
If r denotes the right-ideal {ze A| wor = 0} of A, then 
(15) (ron) tye 


It is clear, because of (12), that the ¢, are S-left-independent modulo rv. 
Hence {¢,} forms an independent left S-basis of A mod. r. Suppose r is 0. 
Then + = 3, (xen) tn, and {t,} forms an independent left S-basis of A. It is 
then easy to verify, observing (12), (15) particularly, that M is (A-two-sided) 
isomorphic to A X sA, by the correspondence {u,} — {t,} over A. 

Our assumption was that up) — vo X Uo gives an (A-)homomorphism of JJ 
into NM. This, however, can be secured either by assuming R(N XM, 
Vo XUo)C R(M, uo), or by assuming R*(N XM, vo R*(M, uo), by 
Propositions 4, 7. This proves the second half of the following Proposition, 
whose first half is readily seen to be true. 


Proposition 10. Let M be a special A-double-module with generator uy. 
Let N be a second special A-double-module which is isomorphic to M, and 1, 
its generator corresponding to Uy. M is isomorphic to A X gA for some sub- 
ring S of A, with uy corresponding to 1 X 1, if and only if 


R(N X My X R(M, wm) (or R#(N X M, vo X to) R*(M, 


and Ux =0 (re A) implies x = 0. 


4. Topologies in M*, 9. Coming back to the general case where K 
and A are perhaps different, we consider K, A, and M all in their discrete 
topologies ; the particular concern is in K. We then introduce in & the weak 
topology, in which a neighborhood of 0 is the set of elements vanishing at a 
finite subset of A. We consider M* also in its weak topology. The mapping 
o—c of M* onto R(M, up) is then continuous, since ro = 0 (i = 1, 2,- - +, m) 


ia 


oth, 


ub- 


ing 


DOUBLE-MODULES OVER ARBITRARY RINGS. 651 


are implied by (1 =1,2,---,m). Provided that generates 
M, as K-A-module (i.e. M = Ku A), the mapping is open too, as we see 
readily ; hence it is a homeomorphism, since it is 1-1 under the assumption. 
Further, R(M, uo) is closed in U, whenever M = Ku.A (or, more generally, 
when every element of (Ku,.A)* can be extended to an element of M*). 
To prove this, let a be an element of the closure of R(M,u.) in YW. We 
then put 


( kitoxi)o = ki 
4=1 


where m is an arbitrary natural number, and kj, 2; are arbitrary elements of 
K, A. This gives a unique definition of o as a mapping of M into K. For, 
with different expressions of an element of M as sums of elements of the 
form ku x, the right-hand sides are equal, since the same is the case with 
arbitrary « in R(M,u). o is clearly K-linear, and oe M*. Furthermore, 
a is equal to the & given by this o, which proves our assertion. 

Assume that M possesses an independent K-basis. Then (M* is dense 
in M* whence) R*#(M, u.) is dense in R(M,u,). Propositions 3, 6, combined, 


give 
Proposition 11. Under the same assumption as in Proposition 6, 
R(N XK M, vo XK Uo) ts the closure of 
R(M, w)R(N, vo) (or, of R*(M, uo) R*(N, v0) ) 
in 


5. The main theorem. Consider again the case K = A, and consider a 
special A-double-module M with generator wu). Suppose R(M, up) is a ring. 
Then Proposition 11 implies that the condition R(N X M, vo K uo) CRM, uo) 
in Proposition 10 is fulfilled. Thus we have 


THEOREM. A special A-double-module M is isomorphic to A X gA, with 
some subring S of A, tf and only tf the relation-module R(M,u.) forms a 
ring and Ux = 0 implies x = 0. 


6. Characterization of relation-modules. So far our theory has been 
fairly smooth. In particular, our Theorem generalizes the similar theorem 
in the sfield case. Coming back to the case KA and seeking for a 
characterization of relation-modules, we encounter difficulties (cf. the examples 
below), which do not prevail in the sfield case. However, before we give up, 


let us prove 


j 
m 
M 
M, | 
by 
on, 
Uy. | 
Vo 
K 
ete 
a 


652 TADASI NAKAYAMA. 


Lemma 3. Suppose a K-right-submodule M of (the A-K-module) % has 
an independent K-basis {a,} such that for each xe A, almost all xa, are 0. 
Suppose further that Mt is dense in A- Mt, and in fact that for each ae A and 
for a fimte number of h, say 1,2,- - +, m, almost all a- a are contained in 
the closure of the K-right-module spanned by {a; (74 1,2,---,m)}. Then 
Mt is the restricted relation-module of a certain K-A-double-module. 


To prove the lemma, let Yt* be the dual-module of the K-(right-) module 
M, i.e. the K-left-module of all continuous K-homomorphisms of Mt into K. 
Since Mt is dense in A-Mt, every element of Mt* can be considered as a 
continuous K-homomorphism of A - Mt into K, and M?* is essentially the dual- 
module of A- Mt. Hence M* is a K-A-double-module. Let wu, be the elements 
of Mt* such that up% —8,i, and let M be the K-(left-)submodule of Mi* 
spanned by these w,. It is the totality of elements u of Yt* such that almost 
all wa; are 0 (for each w). Let wu be the elements of Mt* defined by uo% = 1a 
(ae Mt), 1 being the unit element of A. Because of our assumption, ua, = 0 
for almost all h. Hence u,eM. Further, M is A-right-allowable. To prove 
this, let we M,ae A. Define ua by (ua)a—u(a-a). Let {1,2,- --,m} be 
the totality of indices h such that ue,40. Almost all a- a are in the closure 
of the K-module spanned by {a; (7 £1,2,- --,m)}. Hence almost all (ua)q; 
(= u(a-a)) are 0, which proves uae M. It is now easy to see that Yt can 
be considered as the restricted relation-module of this K-A-module M ; observe 


(16) (Ur) == Uy a) (ae M). 


Proposition 12. In order that a K-submodule Mt of A ts a restricted 
relation-module of a special K-A-double-module, it is (necessary and) suffi- 
cient that Mt be dense in A-Mt and possess an (independent) K-basis {ap} 
such that for each ae A, almost all aa, are 0, and there exist t, in A 
satisfying tha = 8ni. 

To prove this, we consider an arbitrary element a of A, and any finite 
number of h, say 1,2,- - -,m. We want to show that almost all a- a are in 
the closure of the K-module spanned by {a; (j5£1,2,---,m)}. Eacha-% 
is anyway in the closure of Yt, by assumption. As is easily seen, it is an 
infinite sum = ak,“ (k, eK) in the sense of convergent sum in the topology 
of M. Thus t,(a- On the other hand, t,(a- a;) = (tra)a, and 
with a given h this is 0 except for a finite number of 1, which proves our 
assertion. Thus Yt is the restricted relation-module of the module M con- 
structed in the proof of Lemma 3. Also (uoti) a; = ta; = Hence = 
It is now clear that M is a special K-A-double-module with generator 1. 
(The necessity assertion in the proposition is evident.) 


DOUBLE-MODULES OVER ARBITRARY RINGS. 653 


Further, we see readily 


Proposition 13. In order that an A-K-submodule of U be a relation- 
module of a special K-A-double-module, it is necessary and sufficient that it be 
closed in YX and possess a dense subset {an} such that for every ae A, almost 
all aa, are 0, and there exist elements t, in A satisfying tha; = dn. 


Remark. The condition of the existence of ¢, in our Propositions is very 
strong, and in that sense our criteria of relation-modules are very poor. If, 
on the other hand, K is a sfield, then the automatic existence of a, and t, 
can be proved at least for K-finite modules (and also for K-infinite modules 
under a suitable topological condition) ([1], [2]; cf. also §6 below), and 
thus we obtain a nice theorem in that case. However, such is certainly not 
the case in general, and our assumption seems indispensable,* as the following 
examples show: 


Example 1. Let F be an arbitrary field and K the simple algebra of 
all 3-dimensional matrices over F. We take A to be identical with K. 


100 
Let e; = (: 0 0) Let a, be the identity mapping of K (— A), and a, the 
000 

mapping 7—>e,x (xe K). It is easy to see that a,, are K-right-independent. 
The module «,K & aK (C€ M) even forms a ring; the ring property is naturally 
of interest in connection with our theorem (in §5). In spite of these nice 
properties of having an independent finite K-basis and being a ring, our 
module «-K ®a.K does not possess the property required in Proposition 12 
(or Proposition 13),* as we see without difficulty; the argument is similar to 
that in the next example. 


Example 2. We now give an example in which K is a (non-commutative) 
integrity-domain. Let K be the integral domain in a p-adic division-algebra 
which is of degree 2 over its center. Let x be a primitive element of p such 
that +? belongs to the center. We again take A=K. Let «, be the identity 
mapping of K (=A), and a, the mapping (xe K). Again a, 
are right-independent over K, and a,K @a.K is a ring. Nevertheless, this 
module does not fulfill the requirement of Proposition 12 (or Proposition 13). 
(For, if there were B2e and t,,t.¢ K with t,8; = then the 
matrix would be regular. But Ge and this can 
not possess an inverse (in the 2-dimensional matrix ring over K)). 


*It is very desirable, however, to find a simpler substitute. 
‘Or, what amounts to the same, K is not a regular module of a,K @)a,K in the 
sense of [4]. 


as 
0. 

und 
in 
ven 
ule 
RK. 
3 a 
nts 
ost 
la 
ove 
be 
| 
| 
an 
rve 
ted 
ff- 
an} 
A 
Lite 
in 
an 
ind 
yur 
on- 
otis 


654 TADASI NAKAYAMA. 


7. Remarks on the case where K isa sfield. If K is in particular a 
sfield, then naturally every K-left-module M has an independent K-basis. 
Moreover, if N is a K-submodule of M, then every element of N* can be 
extended to an element of M*. Hence, as Lemmas 1, 2 (in §1, 2) show, we 
may restrict ourselves, without loss in generality, to K-A-double-modules M 
which are generated by up) (i.e. cyclic modules in the sense of [1], [2]). 
Further, such K-A-modules are all special with generator wu, (in our sense). 
Thus, except for the criteria (Propositions 12, 13) of relation-modules, our 
results seem to offer satisfactory generalizations of corresponding theorems 
in the sfield case. As for Propositions 12, 13, they do not include the perfect 
characterization of relation-modules given in [2], Theorem 5.3 (finite-dimen- 
sional case) and [1], Theorem 1 (infinite-dimensional case) ; the pathological 
situation in the general case being exhibited in our examples in §5 (they 
show that the immediate extension of [2], Theorem 5.3 (or [1], Theorem 1) 
cannot be true). It is thus necessary to clarify the relationship between 
Proposition 13, for instance, and [1], Theorem 1. In fact, we prove 


Lemma 4. If K 1s a sfield and if an A-K-submodule of A is linearly 
compact (over K) then it has the property of Proposition 13. 


To show this, we have actually to borrow the argument of [1], Theorem 1. 
Let M be the K-A-double-module consisting of all continuous K-homomor- 
phisms of our module into K. Let wu» be the element of M which sends every 
element of our module to its value at 1(e€ A). Then M—Ku,A, as was 
shown in [1]. Let {u,} be an independent K-basis of M contained in uA, 
and let uj, = Uotn. Let M* be the dual module of the K-left-module M. It is 
essentially our given module.> (The dual of the dual of a linearly compact 
codule is, essentially, the module itself.) In fact, if we associate with each o 
in M* the element @ (as defined in § 1) of YM, then the totality of o’s is exactly 
our original submodule of &. Now, let o, and co, be as in §3. Then {oy} 
is clearly dense in M*, and therefore {,} is dense in our module. But 
tion = 8. That almost all ac, are 0 for each a, is clear from the fact that 
as, is simply the coefficient of uw, in the K-linear expression of ua by {un}. 

(This lemma explains the relationship between Proposition 13 and [1], 
Theorem 1. However, since the essential part of the latter is used in the 
proof of our lemma, we do not claim, in the least, that the latter is contained 
in the former, as we want to repeat. (Moreover, in our proof of the lemma 


5 However, we prefer not to identify the two. 


DOUBLE-MODULES OVER ARBITRARY RINGS. 655 


we actually proved that our module is a relation-module, and after having 
done so, Proposition 13 is rather superfluous.) In turn, Theorem 1 of [1] 
depends essentially on (the properties of linearly compact spaces, such as 
having discrete duals and being the duals of their duals, and) [2], Lemma 2. 1 
(i.e. [1], Lemma 1), which is also essentially the theorem of dual bases of 
(finite) modules over a sfield). 


UNIVERSITY OF ILLINOIS. 


REFERENCES. 


[1] J. Dieudonné, “ Linearly compact spaces and double vector spaces over sfields,” 
American Journal of Mathematics, vol. 73 (1951), pp. 13-19. 

[2] G. Hochschild, ‘“ Double vector spaces over division rings,” American Journal of 
Mathematics, vol. 71 (1949), pp. 443-460. 

[3] N. Jacobson, “An extension of Galois theory to non-normal and non-separable fields,” 
American Journal of Mathematics, vol. 69 (1947), pp. 27-36. 

[4] T. Nakayama, “ Non-normal Galois theory for non-commutative and non-semisimple 

rings,” Canadian Journal of Mathematics (1951), pp. 208-218. 


ORDER AND TOPOLOGY IN PROJECTIVE PLANES.* 


By OswaLD WYLER. 


1. Introduction. In a finite-dimensional projective geometry with a 
topological coordinate field or skew-field, the topology of the coordinate field 
induces in a natural way a topology of the geometry, in which the line is 
closed and homeomorphic to the coordinate field with an additional element 
at infinity. This leads to the qeustion of whether a topology of a nondesar- 
guesian plane can be deduced in a similar manner from a topology of its lines, 
and whether two such planes with homeomorphic lines will be homeomorphic. 

In the most general case, this seems to be impossible without complicated 
additional assumptions. There are, however, two important exceptions to this 
statement, ordered projective planes, and planes with coordinates in an alter- 
native division ring. In both cases the answer to our question is affirmative. 

The first case is discussed in this paper. No additional assumption is 
needed beyond the axioms of incidence and of order. 

Ordered projective planes have been studied by many authors, chiefly in 
axiomatic treatments of real projective geometry. In this paper, a topology 
of an ordered projective plane is deduced from the interval topology on its 
lines, and the main properties of this topology of the plane are investigated. 
The Theorem of Desargues is never needed. 

In this topology, the projective operations of joining two points by a 
line and of intersecting two lines in a point are continuous, and two planes 
with homeomorphic intervals are homeomorphic. This completes the answer 
to the question asked above. The homeomorphism between the intervals is 
assumed to map endpoints on endpoints. An order-preserving mapping of an 
interval onto another is such a homeomorphism. We do not have to assume, 
however, that our homeomorphism is order-preserving. 

An example of a nondesarguesian ordered plane, given in section 9, shows 
that the axioms of order, even with the Dedekind continuity axiom added, still 
make it necessary to assume a configuration theorem in order to get coordinates. 

The decomposition of an ordered plane into three convex quadrangles 


* Received June 22, 1951. 
1Cf. e.g. [1], [6], [7]. Numbers in brackets refer to the references at the end 


of the paper. 
656 


( 

] 

L 

t 

t 
P 

Se 

W 
tl 
d 
0 

he 
pl 


ORDER AND TOPOLOGY IN PROJECTIVE PLANES. 657 
with the same vertices, studied in section 6, is constantly used in the next two 
sections.? It is much more useful for our purpose than the usual decomposition 
of a plane into four convex triangles with the same vertices. 

Points are denoted by lower case letters, lines by small gothic letters. The 
line joining the points a and 6 is denoted by ab, the point of intersection of 


the lines [ and m by [N m. 


2. The axioms of separation. An ordered projective plane is charac- 
terized by the axioms of incidence and by a relation of separation between 
pairs of points, satisfying the axioms stated below. We write ab || cd if two 
points a and 6 separate two points c and d. The following axioms of 


separation are assumed. 


S.1. If ab || cd, then a, b, c, d are four different collinear points. 


8.2. If a, 6, c, d are four different collinear points, then at least one of 
the relations ab|| cd, ac || bd, bc || ad holds. 

S.3. If ab || cd, then ab || de. 

S.4. If£ ab || cd and be || de, then cd || ea. 

S.5. If ab || cd, and if a, b, c, d are mapped on a’, 6’, c’, d’ by a per- 
spectivity, then a’b’ || c’d’. 

ab || cd always implies cd || ab by S.1 and S. 5, since abcd and cdab are 
projective for any four different collinear points. This with 8.3 shows that 
separation is a symmetric relation between unordered pairs of points. 

At most one of the relations in S. 2 can hold at the same time, for other- 
wise S. 4 would lead to a contradiction to S. 1. 


If a line contains only three points, ab || cd never holds. If we assume 
that there is a line with four different points, then S. 2 and S. 5 imply 


8.6. If a, b, c are three different collinear points, then there is a point 
d such that ab || cd. 


Applying S.6 repeatedly together with the other axioms, we see that an 


ordered projective plane cannot be finite. 
Our axioms of separation are essentially those of [1], p. 22. Axiom S. 4 


has been put into a more suggestive form. 


*Cf. [2], p. 421, where this decomposition is denoted by {4,3}/2 (for the elliptic 
plane). The author is indebted to the referee for this reference. 


— 


658 OSWALD WYLER. 


3. Segments and intervals. If a, b, c are three different points on a 
line g, we denote by (ab), the set of all points x with ab || crv. A set of this 
form is called a segment on g. The points a and Db are called the endpoinis 
of the segment (ab),. The set consisting of the segment (ab), and its end- 
points a and b is denoted by [ab]-. Such a set is called an interval on g. 
We recall some well known properties of segments. 

If ab || cd, then the segments (ab), and (ab)q are disjoint, and every 
point on g different from the endpoints a and 6 is in one of them. If ¢ is in 
(ab) a, then (ab); (ab)~. There are thus exactly two segments with given 
endpoints a and 6 on g. A point in one of them shall be called an exterior 
point of the other. 

If p and gq are in (ab),, then (ap), and (pq)c are contained in (ab),, 
and 6 is an exterior point of (ap), and of (pq)-. The first part of this pro- 
position remains true if we replace segments by intervals. 


LeMMA 3.1. For any point u of a segment (ab), there exist points 
p and q in (ab),, such that (pq)c contains u and is contained in (ab). 
Then a and b are extertor points of (pq)c- 


Proof. By 8.6 there is a point p such that ua|| bp. Then cu | ab 
implies ab || pe and bp || cu by 8.4. Thus p is in (ab),, and (bp), contains 
u and is continued in (ab),. Similarly there is a point q in (bp),, such that 
(pq) contains u and is contained in (bp).. 


LemMA 3.2. The intersection of two segments (ab); and (cd), with 
a common exterior point t is either empty or a segment (pq)1, where p and q 
are two of the endpoints of the given segments. 


Proof. If wu is in both segments, then c is in one of the segments (tu)a 
and (tw)», and d in the other. If is in (tw), and d in (tu)q, then ca or 
ct || wa or at || uc by S. 2, and similarly db or dt || ub or bt || ud. If the 
segments are not equal we may assume ct || wa. Then tu || ab implies ab || ct 
by 8.4, hence c is in (ab); If b—d, then (cd); is contained in (ab);. 
This holds also for dé || wb, since then d is in (ab);. If bt || ud, then b is in 
(cd);; hence (bc); is contained in (ab); and in (cd);, and it is easily shown 
to be their intersection. 

THEOREM 3.3. If two segments a and 6 ona line g have a point u in 
common, then there exists a segment on g containing u and contained in the 
intersection of a and b. 


Proof. Let s be an exterior point of a, ¢ an exterior point of 6. If st, 
then aM 6 is a segment by Lemma 3.2. If s=4?7, then there exists a seg- 


| 
a 
a 
W 
3 
it 
ea 
pec 
li 
a 
to 
pr 
an 
po 
fo 
po 
of 
lir 
of 
cor 
Co 
ant 
eX 
anc 
of 
cor 
anc 
are 


ORDER AND TOPOLOGY IN PROJECTIVE PLANES. 659 


ment c containing uw with s and ¢ as exterior points by Lemma 3.1. Then 
afc is a segment with exterior points s and ¢ by Lemma 3. 2, and hence 
is a segment containing wu. 


Theorem 3.3 shows that segments on a line g define a topology on g in 
which they are a base of open sets. In this topology the closure of the seg- 
ment (ab), is the interval [ab],, so that intervals form a base of closed 
neighborhoods. The topology is called the interval topology on g. By Lemma 
3.1, g is a regular space in its interval topology. 

By S. 5 a projectivity between two lines maps segments on segments and 
intervals on intervals, hence it is a homeomorphism between the two lines. 


4. Sectors. By specifying that two pairs of lines in a pencil separate 
each other if and only if they intersect a line not in the pencil in pairs of 
points separating each other, we obtain a relation of separation for pairs of 
lines. Because of 8.5, this definition is independent of the choice of the 
auxiliary transversal line. It is immediately seen that the dual propositions 
to our axioms S.1 to S.5 hold. Hence the definitions and results of the 
preceding section can be dualized. The dual of a segment will be called an 
angle. 

Let 1 and m be two lines with a common point r, and let a and b be two 
points different from r. Then we shall say that Im || ab whenever Im || pq 
for pra and grb. If I and m are two different lines, and if ¢ is a 
point neither on [ nor on m, then we denote by (1; m), the set of all points v 
of the plane such that Im || cz. A set of this type is called a sector. The 
f lines I and m are called the sides, and the point 1M m is called the verter 
of the sector (1; 

To the sector ({;m),~ corresponds the angle where r—=I/N m, 
consisting of all lines of the pencil with vertex r through points of (I; m)-. 
Conversely, the sector ({; m), consists of all points on lines of the angle (Im),c 
and different from its vertex r, or in other words, of all points lying on 
exactly one line of the angle (Im) re. 

Thus the dual of a sector is the set of all lines meeting a given segment 
and different from the line containing the segment, or in other words, the set 
of all lines intersecting a given segment in exactly one point. 

The following properties of sectors are immediately derived from the 
corresponding properties of segments on a line and of angles in a pencil. 

Let [ and m be two different lines with point of intersection r, and let c 
and d be two points such that Im || cd. Then the sectors (1; m),- and (I; m)¢ 
are disjoint, and every point of the plane not on [ or on m lies in one of them. 


9 


‘ 

) 


660 OSWALD WYLER. 


If ¢ is in (1; m)a, then (1; m),—(1;m);. There are thus exactly two sectors 
with given sides [ and m. A point in one of these sectors shall be called an 
exterior point of the other. 

If g is a line not through r, let p—=gNlandg=—gnMm. Then one of 
the segments with endpoints p and q is contained in ([; 1m) , the other in 
({;m)a. If g is a line through the vertex r, different from [ and from m, 
then all points of g different from r lie in the same sector with sides [ and m. 

This implies immediately the following lemma. 


Lemma 4.1. Let p and q be two different points of the sector (1; m)c, 
and let l=pqn({Nm)c. Then the segment (pq): is contained in the 
sector (1; 


5. The order topology of the plane. A set a of points in the plane is 
called g-convex, where g is a line, if a and g are disjoint, and if for any two 
different points p, g in a, the segment (pq); is contained in a for t= pq / g. 
A set of points is called convex if it is g-convex for some line g. A convex 
set is called open if its intersection with any line is an open set in the interval 
topology of the line. 

A set consisting of one point not on g is g-convex, and so is r — g, where 
a is the plane. Moreover, —g is open. If a and 6 are two different points 
not on g, and if cab gq, then the segment (ab), and the interval [ab], 
are g-convex. <A sector (I; is (rc)-convex and open for By 
Lemma 5.1 below, (I; 1m), is also [-convex and m-convex. The intersection 
of any number of g-convex sets is again a g-convex set. 


LemMA 5.1. A g-convex set a is h-convex for any line h disjoint with a. 


Proof. If p and gq are different points of a, let t= pq gq, and let 
u=pqth. Then wu is an exterior point of (pq): by assumption, hence 
(pq)u = (pq)+ is contained in a. 


THEOREM 5.2. Convex open sets define a topology of the plane of points, 
in which they are a base of open sets. 


Proof. Since any point of the plane is contained in some convex open 
set, it suffices to prove that for any two convex open sets a and 6 with a 
common point u, there exists a convex open set containing wu and contained 
in an b. 


Let a be I-convex, and let 6 be m-convex, where I and m are two lines. 
If [—m, then aN b is I-convex and also open, since the intersection of two 
open sets on a line is open. If [+4 m, let c be the sector with sides I and m 


th 


an 


I 

T 

Bi 

|. 
| 


m 


ORDER AND TOPOLOGY IN PROJECTIVE PLANES. 661 


containing wu. Then c is [-convex and m-convex and open. Hence afc is 
[-convex and open, and also m-convex by Lemma 5.1. Thvs ancn 6} is 
m-convex and open. It contains wu and is contained in af Bb. 

The topology defined in Theorem 5. 2 is called the order topology of the 
plane of points. The order topology of the plane of lines is defined dually. 
From now on, the plane of points and the plane of lines are considered as 
topological spaces with their order topologies. In the plane of points, lines 
are closed sets, and the relative topology on a line is its interval topology. 


6. Convex quadrangles. 


Definition. A convex quadrangle is the intersection of two sectors such 
that the vertex of each is an exterior point of the other. 


If r and s are the vertices of two such sectors, then r ~ s, and both sectors 
are (1's)-convex open sets; hence so is their intersection. A convex quadrangle 
is thus an open convex set. 

The intersection of two sectors with different vertices r and s is a convex 
quadrangle if and only if the line rs joining the two vertices is a common 
exterior line of the two angles corresponding to the sectors. 

If a, b, c, d are four points, no three of which are collinear, let 


r=abf cd, s=adN be, t=acN bd 
be the diagonal points of the complete quadrangle abcd, and let 
ps = (ab; cd), or = (ad; bc),, tr = (ac; bd),, 
pt = (ab; cd), o; = (ad; bc)+, Ts = (ac; 
These notations will be used consistently. 


If and then* Im|| st. Hence ps and are the two 
sectors with sides ab and cd, and ps; Me, is a convex quadrangle containing ¢. 
It is easily seen that any convex quadrangle can be obtained in this fashion. 
This shows also that a convex quadrangle is never empty. The points a, b, c, d 
are called the vertices, the segments (ab),, (bc)s, (cd),, (ad)s, the sides of 
the quadrangle ps, 1 or. 


THEOREM 6.1. The three convex quadrangles 


Ps Nor, Pt N Trs Ct N T 35 
their sides 


(ab),, (cd)r, (ad)s, (bc), (ac), (bd)+, 


and their vertices a, b, c, d, form a covering of the plane by disjoint conver sets. 


* Cf. [1], section 3.21, p. 24. 


| | 
s 
e 
5 
& 
n 
t 
a 


662 OSWALD WYLER. 


Proof. Let p=abf st. Then ab || pr, and the segment (ab), contains 
p and is contained in o, and in 7,; hence it does not intersect one of the 
quadrangles. It is equally easily verified that all the sets of the covering 
are pairwise disjoint. 


An exterior point of (ab), on ab lies in (ab)», hence in o¢ M75. Thus 
a point on a side of the complete quadrangle abcd always is in one of the sets 
of the covering. 

Now let x be a point not lying on any side of the complete quadrangle 
abcd. Then z is in one of the sectors ps and p;, and we may assume 7 € p,. 
If x is in o;, z is in the quadrangle ps o,. Otherwise z is in oy. Let then 
u=stfiab and v=sxficd. Then wu is in (ab), and hence in rz, and 
similarly v is in ts. Since (wv), is the intersection of sx with ps, x is in (wv), 
But (uv), is contained in the (st)-convex set rs, and so x is in the quadrangle 
os. This completes the proof. 


LemMA 6.2. The boundary of a convex quadrangle consists of tts sides 
and vertices. 


Proof. The complement of the open set p; U o; consists of the quadrangle 
ps | oy with its sides and vertices, hence this is a closed set. If wu is on a side 
of or one of its vertices, let v= utMrs. Then the segment (ut), is 
contained in ps o,, and wu is a boundary point of (ut),, hence also of ps N oy. 


Lemma 6.3. If p is in the segment (ab),, and q in (cd),, then one of 
the two segments with endpoints p and q lies in ps N o;, and the other tn p: N 7. 


Proof. Let u=—pqNrs and Then pq || uv, and is 
contained in psM (pq)v in pi N 


If the dual of a convex quadrangle is called a convex quadrilateral, then 
the lines intersecting two segments on two different lines form a convex quadri- 
lateral if and only if the point of intersection of the lines is a common exterior 
point for the segments. Thus the lines joining a point of (ab), and a point 
of (cd), form a convex quadrilateral. By Lemma 6.3, the lines of this 
quadrilateral do not meet the quadrangle o; M 7, or its boundary. It is easily 
seen that every convex quadrilateral can be obtained in this fashion. 


7. Continuity theorems. 


LemMA 7.1. Let 2, y, z be three points not on a line, and let a be an 
open convex set containing x. Then there is a complete quadrangle abcd with 
diagonal points r—y and s=z such that the convex quadrangle p, o% 
contains x and 1s contained in a together with tts boundary. 


Or 


ORDER AND TOPOLOGY IN PROJECTIVE PLANES. 663 


Proof. If a is [-convex for a line [ ~ yz, let b be the sector with sides | 
and yz containing z. Then af b is (yz)-convex and open. Hence we may 
assume a to be (yz)-convex. Then there is a segment (uv), on vz containing 
z and contained in a together with its endpoints. Since the projection of uy 
on vy from z maps segments on segments, there are segments (ab), on uy 
and (cd), on vy, contained in a together with their endpoints, such that 
ab || wy and adM bc =z. Then cd || vy, and abcd is the desired quadrangle. 
For r= y, s=2, and z in (uv), lies in The vertices of lie 
in the (rs)-convex set a; hence the quadrangle and its sides are contained in a. 


Lemma 7. 1 shows that the plane of points is a regular topological space, 
and that convex quadrangles form a base of open sets. 


THEOREM 7.2. The point of intersection of two different lines { and m 
is a continuous function of the pair (1, m). 


Proof. By Lemma 7.1 we may assume that a neighborhood of the point 
IN m is a convex quadrangle ps o, with r on m and s on [. Then the 
lines intersectiong (ab), and (cd), form a neighborhood LZ of {, and the lines 
intersecting (ad), and (bc), form a neighborhood M of m. A line in LZ does 
not meet o; | rt, or its boundary by Lemma 6. 3, and a line in M does not 
meet p; 7, or its boundary. Then it follows from Theorem 6.1 that the 
point of intersection of a line in Z and a line in M must be in ps o;, and 
this proves the theorem. 


THEOREM 7.3. The line joining two different points a and b ts a con- 
tinuous function of the pair (a,b). 


This is the dual of Theorem 7. 2. 


8. Homeomurphism theorems. We shall use closed neighborhoods in 
this section rather than open neighborhoods. The closure of a convex quad- 
rangle, consisting of the quadrangle with its sides and vertices, shall be called 
a projective square. The closure of ps a, will be denoted by T, the closure 
of pp 7, by S, and the closure of o; 7, by R. The sides of T are the 
intervals [ab],, [bc]s, [cd], and [ad]s. 

Any two intervals in the plane are homeomorphic, since projectivities are 
homeomorphisms. If the theorem of Desargues is valid in an ordered plane, 
then all intervals are homeomorphic to the unit interval in the ordered coordi- 
nate field or skew-field. If we speak of a homeomorphism between two 
intervals, we always assume that endpoints are mapped on endpoints. 


n 
i- 
it 
18 
y 
h 


664 OSWALD WYLER. 


THEOREM 8.1. A projective square is homeomorphic to the cartesian 


product of two intervals. 


Proof. For any point wu in T, let us and y=adN ur. Then 
x in [ab], and y in [ad], are continuous functions of u, and u=—<zs/N yr. 
Thus w is also a continuous function of the pair (z,y¥), so that this corre- 
spondence is a homeomorphism between T and the product [ab], X [ad]. 


If abcd and a’b’c'd’ are complete quadrangles in two ordered projective 
planes with homeomorphic intervals, then there is a homeomorphism between 
the boundaries of the projective squares T and T’, by which the vertices and 
sides of 7’ are mapped on the corresponding vertices and sides of J’. Such a 
homeomorphism shall be called a p-homeomorphism, where p means “ proper.” 
A homeomorphism between 7 and 7” will be called a P-homeomorphism if it 
induces a p-homeomorphism between the boundaries. 


Lemma 8.2. There is a P-homeomorphism between T and T’. 


Proof. There are homeomorphisms ¢ between [ab], and [a’b’], and 
y between [ad], and [a’d’], such that ¢(a) =y(a) =a’. For any point u 
of T, let 2 —¢(abNus), y =—y(adN ur), and N y’r’. Then 
® is a P-homeomorphism. 


Lemma 8.3. For every p-homeomorphism ¢ between the boundaries of 7 
and of T’, there is a P-homeomorphism © between T and T’ that agrees with 
¢@ on the boundary. 


Proof. Since the product of two P-homeomorphisms is a P-homeo- 
morphism, it suffices by Lemma 8. 2 to prove Lemma 8. 3 for T=—T”’. Again, 
it is sufficient to consider p-homeomorphisms which leave three sides of T 


fixed, since every p-homeomorphism on the boundary of 7 is a product of 


p-homeomorphisms of this special type. 


Now let ¢ be a homeomorphism of [ab], onto itself with ¢(a) =a and 
$(b) =b. Let A be the closure of the open set 7, (ab;tr),. Then A is 
contained in 7’, and its intersection with the boundary of T is the interval 
[ab],. The boundary of A consists of the three intervals [ab],, [at]-, and 
[bt]a4. For uw in A, ut, the point z—ab/N tu is in [ab],. Now define 
@(u) = ¢(x)tN ru for u in A, ut, and define (wu) —wu for u in T—A 
or u=t. Then (x) —¢(z2) for z in [ab],, and ®(u) —w for wu in [at], 
or in [bt]a. It follows that @ is continuous everywhere in T, except possibly 
at ¢. 


If a” is a point of [at]., then b’—btN ra” is in [bt]., and b”, 


E 


ORDER AND TOPOLOGY IN PROJECTIVE PLANES. 665 


d” = dt sa”, and c’ = b’s N d’r are continuous functions of a”. If a” =t, 
then 6” = c” =d” =t. Now let a be a convex neighborhood of t. We can 
choose a point a” in [at],, different from ¢, such that a”, 6”, c”, and d” are 
ina. Then the projective square T” with these vertices is a closed neighbor- 
hood of ¢ contained in a. It follows from the construction that #(T”) =T”, 
hence ® is continuous at ft. 

Since the inverse mapping of ® can be constructed in the same way as ®, 
® is a P-homeomorphism of 7 onto itself which agrees with ¢ on [ab], and 
leaves the other three sides of 7 fixed. This proves the lemma. 


THEOREM 8.4. Two ordered projective planes with homeomorphic inter- 
vals are homeomorphic. 


Proof. Let abcd be a complete quadrangle in one plane, a’b’c’d’ a com- 
plete quadrangle in the other one. Then there is a mapping ¢ of the sides 
and vertices of R, S, and T' on the sides and vertices of R’, 8’, and 7’, which 
determines a p-homeomorphism for each of the three pairs of corresponding 
projective squares. Then by Lemma 8. 3 there are homeomorphisms between 
R and R’, between S and S’, and between T and 7” which agree with ¢ on the 
boundaries. Since two squares in the same plane have no common interior 
point, these homeomorphisms define a homeomorphism between the two planes. 


Corottary 1. In an ordered projective plane, the plane of lines is 
homeomorphic to the plane of points. 


CoroLuary 2. If an interval in an ordered projective plane x is homeo- 
morphic to the unit interval in an ordered field or skew-field K, then x ts 
homeomorphic to the coordinate plane over the field K. 


It should be remarked that the homeomorphism of Theorem 8.4 need 
not be a collineation. In fact, section 9 gives an example of two ordered 
projective planes which are homeomorphic, but not collinear. 


9. A nondesarguesian ordered plane. Let K be any ordered field or 
skew-field. We define points by homogeneous coordinates (2, y,z) over K 
with right multiplication, lines by homogeneous coordinates (a,b,c) with 
left multiplication. 

A point (z, y, z) shall be on a line (a, 6, c) with ab = 0 if and only if 


ax + by + cz=—0. 


For a line (a,b,c) with ab <0, we distinguish three cases. A point 
(x, y, 2) shall be on this line if and only if 


e 
a 
9 
d 
n 
] 
h 
)- 
1, 
yf 
d 
is 
al 
d 
e 

A 

le 

ly 


OSWALD WYLER. 


ax + by+cz—0 for 


tax + by + cz =0 for x between 0 and z, 
az + by +(c—4a)z—0 for z between 0 and z. 


This is a modification of the well known example of a nondesarguesian 
affine plane given by Hilbert and originally by F. R. Moulton.*. The verifica- 
tion of the axioms of incidence is quite easy, but rather lengthy. If we define 
a separation relation for pairs of points in the obvious manner, then the 
axioms 8.1 to S.5 are readily verified. 

Consider now the triangles abc and a’b’c’ with vertices 

a: (0,1, 2); »:(1,1,2); 4, 1); 
a’: (0, —1, 2); b’: (1,—1, 2); e’: (1,—1,1). 
These triangles are perspective with center (0,1,0), but the points 


ab a’b’: (1, 0,0); b’c’: (0, 0,1); ac a’c’: (4,1, —6) 


are not collinear. 
By Theorem 8. 4, Corollary 2, our plane is homeomorphic to the coordi- 


nate plane over the field K, but the two planes cannot be collinear. 


NORTHWESTERN UNIVERSITY. 


REFERENCES. 


[1] H. S. M. Coxeter, The real projective plane, New York, 1949. 

[2] , “Self-dual configurations and regular graphs,” Bulletin of the American 
Mathematical Society, vol. 56 (1950), pp. 413-455. 

[3] Marshall Hall, “ Projective planes,” Transactions of the American Mathematical 
Society, vol. 54 (1943), pp. 229-277. 

[4] David Hilbert, Grundlagen der Geometrie, 7. Aufl., Leipzig und Berlin, 1930. 

[5] F. R. Moulton, “A simple non-desarguesian plane geometry,” Transactions of the 
American Mathematical Society, vol. 3 (1902), pp. 192-195. 

[6] Ernst Steinitz, Vorlesungen iiber die Theorie der Polyeder, Berlin, 1934. 

[7] Oswald Veblen and J. W. Young, Projective geometry, Boston, vol. I, 1910, vol. II, 
1916. 


* [4], p. 85, and [5]. 


666 


MEANS IN GROUPS.* 


By W. R. Scort. 


1, Introduction. A number of authors have given sets of postulates 
for the arithmetic mean of n real numbers. Several of these sets of postulates 
have been given in purely algebraic form (see [2], [3], [4], [5], [7], [8]), 
and some of these latter are suitable for generalization to groups. Only the 
set due to Schimmack [7] will be discussed. 

Let G be a (not necessarily Abelian) group written additively. Let 
let +,%,) Schimmack’s postulates are: 


(1) + =h+ ++, 2n) for all heG. 


(2) fn(— = — fn(%1,° * » 


(3) fn is a symmetric function of +, 
In (4) and throughout the paper f, will be used for f,(21,° - -,2n) whenever 
no confusion will result. A sequence {f,}, n=1,2,---, of functions will 


be called a sequence for brevity. A sequence satisfying (1), (2), (3), (4) 
will be called a mean on G. 

Schimmack [7] showed that if G is the additive group F of reals, then 
the only mean is the ordinary arithmetic mean f, = +--+ /n. 
Beetle [1] showed that if G = R, then the postulates (1), (2), (3), (4) are 
completely independent. It will be shown here that, more generally, if G is 
an infinite Abelian torsion-free group such that mG=—G for all positive 
integers m, then Schimmack’s and Beetle’s conclusions hold. The question 
of existence and uniqueness of means, together with the complete independence 
of (1), (2), (3), (4), will be treated as completely as we are able. 


2. Existence of means. If {f,} satisfies (1) and (2), then it follows 
readily that for all he G, 


If {fn} satisfies (4), then by induction, for 1S rn, 


* Received September 5, 1951. 


667 


W. R. SCOTT. 


THEOREM 1. A group G possesses a mean tf and only if 


(i) the equation mzr=—g possesses a unique solution (denoted by 
z=4g/m) for every ge G and every positive integer m; and 


(ii) y+ (—y + 2)/(n +1) (—24+ 2ny)/(n +1) for every 
yeG, zeG, n = 2. 


If a mean exists, it is unique and is given by 


fi(t%1) =X, 
(5) fn = faa + (—fna /n. 


Proof. Parts of the argument are the same as in [7] but they will be 
given here for the sake of completeness. 


Suppose first that a mean {f,} exists. 
G is torsion-free. For if g 0 is of finite order m, 
then by (1) and (3) we have 
fm(g, (m—1)g,0) + fm(0,9,° (m—1)g) 
= 9 + fm(9,29,° (m—1)g, 0), 
which is a contradiction. 


By (2), fn(0,- -,0) =—fn(0,---,0), whence f,(0,-- -,0) =0, 
since G is torsion-free. Hence fn(g,- --,9g) =g by (1). Thus in particular 
fi(v:) =, as asserted in (5). 

We assert that 


In fact, f2(9,0) =9 + f2(0,—9) =9 —f2(0,9) —fe(g, 0), and (6) is 
true forn = 2. Assume that (6) holds for n = 2%. Let - -,0) =y. 
Thus 2*y = g. Let z be such that 2z = y (such a z exists by (6) with n = 2). 
Thus Then by (4’) 

= +, 0), 0,---, 0) 

= +, 0) 

= 281 (2 4 (Z,° = 
the last equality following from the fact that 


668 


MEANS IN GROUPS. 669 


whence *,—2) =0. Thus by induction (6) is true 
for n= 2", r=0,1,2,---. Suppose (6) true forn+1. Let y=f,(g, 0, 

-,0). Then 
g = (N+ 1) fn (9, 9,° +, 0) 

(2 + 1) (fn (9, 0, ° 0), 0) 

= (m + 1) fnsi 0) (n + 1) (y + (9, 0,—y) 

= (n+ 1)y—y= ry, 
the next to the last equality holding since y and fn.1(0,- - -,0,— y) commute, 
because — y = (n+ 1)fnii(0,- > -,0,-—y). Hence (6) is true by induction. 

It follows from (6) that the equation mz=g has at least one root. 
For m =1 it has exactly one root. Again f.(29,0) + f2(9,—9) =49, 
and (i) is true for m2. Assume that (i) is true for m=—1,---,n—1, 
where n= 3. Then by (1)-(4), (6), 
fn(ny, 0,- -,0) 

= fn (fra (ny, 0, fra(ny, *,0),0) 

= fn(ny/(n—1),°: ny/(n—1),0) 

= fin(fnaly, (n— 1)y/(n—2),- (n—1)y/(n—2)), 

fna(y; (n—1)y/(n—2),- (n— 1)y/(n— 2)), 0) 
= (n—1)y/(n—2), 0), 
(n —1)y/(n— 2), 0), y) 

Thus ny = nz implies 
y = fn(ny, 0, - 0) = fn(nz, 0, 

and (i) has been proved by induction. 

For n> 1, 

fn —_ ful In) 
= frat fn(0,° +n) = fra + + tn)/n, 


and (5) has been verified. The uniqueness of the mean follows from (i) 
and (5). 
By (3), we have 


(7) 


Tues (2, * 5 Un-15 Zn), 


(2, 


| 

| 


670 W. BR. SCOTT. 


whence by (5), 


(8) + (— faa + + ((— + — fra + + 1) 
= fina + (— faa + + ((— + — fa + + 1). 
Let y and z be given. Choose any 21, --,2%n-1, and then choose z, and 


80 that 


(9) ny = — fra + Tne 


nz == — fins + 
Making the substitutions (9) in equation (8), we get (ii). 

Conversely suppose that (i) and (ii) are satisfied by G. Define {fn} 
by (5). Evidently f,(7,) — 2, satisfies (1), (2), (3), (and (4) vacuously) 


for n=1. 
Assume that (1) holds for a certain n. Then 


+21,°°°,h+ Tna1) 
= fn(h + + an) + + + +h + + 1) 
=h+t fat (—fa—h +h t+ + 1) + fan, 
and (1) holds for all n. 
Assume that (2) holds for given n. Then 
= fn(— *,— + (—fn(— — + 1) 
=—fn+ (fn + 1). 
Now (n + 1)(— fn + (fn — + 1) + fn) = — + fre Hence 
— fu + (fu + 1) + + 1) — fre 
Thus 
* = (— + +1) — fn 
= — (fn + (—fn + + 1)) = — far 
Hence (2) holds for all n. 
Re (4), it follows by induction that fn(z,---,2) =a. Therefore 
fnsa(fns* * * > fn 
= fn(fny* fn) + 5 fn) + + 1) 
= fn + (— fn + + 1) = finer 
Hence (4) holds for all n. 


fod 


MEANS IN GROUPS. 671 


As noted above, (3) holds for n—=1. Now —2, +-2, + 2, + 22) 
Hence a, + + 22) = 22 + 22 + 21), or fo(%1, 
| =f2(r2,2:), and (3) holds for n2. Assume that (3) holds for a given 
PF n2=2. Then to prove (3) inductively it is sufficient by (4) to prove (7), 

| » ice. (8). But (8) follows from (ii) by means of the substitution (9). 
\ Thus (3) is proved by induction, and (5) defines a mean on G. 


Corottary. If G is an Abelian torsion-free group such that mG = G 
for all integers m > 0, then G has a unique mean given by 

Proof. It is easily verified that G satisfies conditions (i) and (ii) of 


Theorem 1. Equation (5) yields the above mean. 


There remains the question of the existence of a non-Abelian group 
satisfying conditions (i) and (ii) of Theorem 1. 


3. Complete independence. In order to discuss complete independence 
| of the postulates (1)-(4) it is necessary to discuss the existence of 16 types 
of sequences. For brevity these types will be numbered as follows: 


Bou 9% —F+4+4 18. ——4+4 
2+4+4+— 6 +—-4+— 10 —4+4+— 14 ——+— 
++—-4+ %  —+—+4+ 1. ———+4+ 
4 ++—— 8 +——— 1% —4+—-— 16 ———-, 


where a sequence is of type 7, for example, if (1) and (4) are true while 
(2) and (3) are false. The history (i.e. as to existence or non-existence) 
of each type of sequence will be discussed as completely as we are able. This 
has already been done for type 1. For the group G of order 1, clearly 
sequences of types 2 through 16 do not exist. From now on assume that G 
has at least 2 elements. Let ge G, g=~0, be a fixed element. 


Lemma 1. Every group G possesses sequences of types 3, 4, 9, 10, 11, 12. 
Proof. They will be exhibited.* 
3. Let fx 
4. Let if and let f-— Re (4), 
fs(g, 0,0) =g =fe(fe(g, 0), fo(g, 0), 0). 


* Several of the examples used in the proofs of Lemmas 1 and 2 were obtained by 
L. A. Colquitt and the author while working on the real case. 


W. R. SCOTT. 


9. Let =0. 


10. Let f.(0,2) = f2(z,0) for all xe G, and let f, —0 otherwise. 
Concerning (4), we have f2(g,0) =g ~0= f2(f:(g), 0). 


11. Let f, and f, —7z, for n>1. 
12. Let f, —0 and f, for n>1. 


LemMA 2. A group G possesses sequences of types %, 8, 13, 14, 15, 16 
if and only if G has an element y of order greater than 2. 


Proof. If G has no element of order greater than 2, then —x —~a for 
all xe G, and 


Therefore (2) is satisfied, and the listed types (and 5 and 6) do not exist. 


Conversely suppose that G has an element y of order greater than 2. 


Then the following sequences will show the required existence. 
Let fa—=an+y. 
8. Let 
13. Let fa=y. 


14. Let f.(0,27) = f2(z,0) for all xe G, and let f, —y otherwise. 
Re (4), we have f.(0,0) =O0~y = f2(f1(0), 0). 


15. Let fra 
16. Let fn y. 
Lemma 3. A sequence of type 6 exists if and only if G is torsion-free. 


Proof. Suppose that a sequence of type 6 exists. Then the first part of 
the proof of Theorem 1 shows that G must be torsion-free. 


Conversely suppose that G is torsion-free. Let 


if r=1,:--,n, for some permutation of 
(1,---,m). If also Yn), then h + =h’ + 
+ yj, A Yin =’ + Hence yj, = m(—h +h’) +9, 
and m(— h + h’) =0. But since G is torsion-free, —h + h’ = 0, andh = h’. 


672 


MEANS IN GROUPS. 673 


some (unique) h. Clearly ~ is an equivalence relation. Choose a repre- 
sentative - -,2,*) in each equivalence class - -,@n)}, choosing 
(9), (g,9) and (g,—g) as the representatives of their respective equivalence 
classes, where g is a fixed element 0. Note that {(9,0)} ~{(g,—g)}. 
Let be given. Then -,%n) +, an*). Define 
fn(Z1;° =h. Now (1) and (8) are obviously satisfied. Re (2), 
f:(— 0) =—gA~Ag=——f.(0). Re (4), 


fo(29,0) = 0) = fe(fi(2g), 9). 


The only types not yet discussed are types 2 and 5. The results on 
these are only partial. 


Lemma 4. If G@ possesses a sequence of type 2, then G 1s torsion free 
and possesses a unique solution x of the equation 2x —g for all geG. 


Proof. This was shown in the proof of Theorem 1, where (4) was not 
used to prove the existence and uniqueness of such solutions. 


LemMA 5. Let G be such that 
(i) 22—g always has a solution for x; 


(ii) na —g,n>1, has at most one solution for x for anygeG. Then 
a sequence of type 2 exists. 


Proof. Because of its length, only an outline of the proof will be given. 


Let +, %n)Ri(y1,- Yn) if there exist h, ke G and a permu- 
tation (%1,° of (1,---,m) such that 
tr=h+ yi, +k, fam 
Let Yn) if there exist h, ke G and a permutation 


(t1,° of (1,- + such that 
tr =h—yi,+ hk, 


Let (21,° ~ (915° Yn) if either (21,° +, %n)Ri(91,° Yn) OF 
(%1,° *,Yn). It follows that ~ is an equivalence relation. 
In each equivalence class choose a representative (z,*,- - -,2p,*), letting 
(0,9), (0, 9,39), and (49, 49, 3g) be the representatives of their (distinct) 
equivalence classes, where g is a fixed element ~ 0. 

Define f, as follows: 


| 
| 


W. R. SCOTT. 


+ 4(—2,* + %,*) 


x,* otherwise; 
+ +, h—a;,* + k) +, +k. 
It can then be shown that f, is well defined, and satisfies (1), (2) and 
(3). Re (4), we have 
fs(f2(, 9), f2(0, 9), 39) = 29, 39) = 39 ~0 = 9, 3g). 


6. If G can be ordered so that x<y impliesh+a<ht+y 
for all he G, then a sequence of type 5 exists. 


Proof. Define fn(%1,° = Then (1), (3) and 
(4) are satisfied. Now f.(9,—g9) =f:(—9,9) =gor—g. Sinceg~—g 
(such groups G are torsion-free), (2) is not satisfied. 

See [6] for a discussion of such groups with the additional restriction 
that y imply r+h<y+h for all heG. 

THEOREM 2. If G ts an Abelian torsion-free group such that nG = G 


for all integers n>0, then postulates (1), (2), (3), (4) are completely 
independent. 


Proof. Such a group G can be ordered (see [6]). Hence by Lemma 6 
a sequence of type 5 exists. The other types exist by the Corollary to 
Theorem 1 and Lemas 1, 2, 3 and 5. 


Sequences of types 2 and 6 can be replaced by simpler sequences in this 
case by making use of an ordering of G. 


2. Let fn(%1,° =4(min + max). 
6. Let f,—min-+ g, where g is a fixed element ~ 0. 


THEOREM 3. If G is the additive group of reals, then there exists a 
unique mean on G, namely fy = (4: +-° - -+2%n)/n, and (1), (2), (3), (4) 
are completely independent. 


This follows from Theorem 2 and the Corollary to Theorem 1. A similar 
theorem for the geometric mean of positive real numbers can be given, of course. 


UNIVERSITY OF KANSAS. 


MEANS IN GROUPS. 


REFERENCES. 


[1] R. D. Beetle, “On the complete independence of Schimmack’s postulates for the 
arithmetic mean,’ Mathematische Annalen, vol. 76 (1915), pp. 444-446. 

[2] , Bulletin of the American Mathematical Society, vol. 22 (1916), pp. 276-277. 

[3] E. L. Dodd, “ The complete independence of certain properties of means,” Annals 
of Mathematics, vol. 35 (1934), pp. 740-747. 

[4] E. V. Huntington, “Sets of independent postulates for the arithmetic mean, the 
geometric mean, the harmonic mean, and the root mean square,” Trans- 
actions of the American Mathematical Society, vol. 29 (1927), pp. 1-22. 


[5] S. Narumi, “ Note on the law of the arithmetical mean,’ Téhoku Mathematical 
Journal, vol. 30 (1929), pp. 19-21. 

[6] B. H. Neumann, “On ordered groups,” American Journal of Mathematics, vol. 71 
(1949), pp. 1-18. 

[7] R. Schimmack, “ Der Satz vom arithmetischen Mittel in axiomatischer Begriindung,” 
Mathematische Annalen, vol. 68 (1909), pp. 125-132. 

[8] O. Suto, “Law of the arithmetical mean,’ Téhoku Mathematical Journal, vol. 6 

(1914), pp. 79-81. 


: 675 
£ 
| 
y 
q 
n 
0 
S 
4 
10 


A PROOF OF THE MAXIMAL CHAIN THEOREM.* 


By FRINK. 


The maximal chain theorem was first proved by Hausdorff in 1914 in [3] 
using transfinite induction. It states that every chain in a partially ordered 
set is contained in a maximal chain. It is equivalent to Zorn’s lemma (cf. 
[1], [2], [5], [7], [8], [9], [10], [11], [18]). It was stated and proved 
long before Zorn’s lemma, is somewhat simpler in form, and often just as 
convenient to use. Like Zorn’s lemma, it owes its great usefulness to the fact 
that it allows one to avoid the theory of ordinal numbers and well-ordered sets 
in giving proofs in abstract mathematics. For this reason it would be desirable 
to have a proof of the theorem which is independent of the nction of a well- 
ordered set, and dependent only on the axiom of choice. However, all the 
proofs in the literature, either of the maximal chain theorem or of Zorn’s 
lemma, seem to involve the notion of a well-ordering (cf. [1], [2], [5], [6], 
[7], [8], [9], [11]). The following proof does not involve the notion of a 
well-ordered set. It was suggested by Zermelo’s second proof of the well- 
ordering theorem in [12]. 


THEOREM. Every chain of a partially ordered set is contained in a 
maximal chain. 


Proof. A chain is a simply ordered set; that is, of any two distinct 
elements of a chain, one necessarily precedes the other. Suppose the theorem 
false. Then in some partially ordered set P there is a chain A not contained 
in any maximal chain. Then corresponding to each chain C which includes 
A as a subset select, by means of the axiom of choice, a larger chain C’, called 
the successor of C, containing only a single element of P not in C. This is 
possible since by assumption no chain which includes A is maximal. 

We shall call a collection K of chains of P all of which include A complete 
if A is in K, the successor of each member of K is in K, and K contains the 
union of each chain of its chains. Clearly the collection of all chains which 
include A is complete, and the intersection of any set of complete collections 
is complete. Let J be the intersection of all complete collections of chains 
of P. Then J is the smallest complete collection. We wish to prove that J 
is a chain, which will involve a contradiction. 


* Received July 11, 1951. 
676 


THE MAXIMAL CHAIN THEORY. 677 


We shall call a chain C which is a member of the collection J normal 
if for every chain X of J either X CC or CC X. It will be proved that 
every member of J is normal. If C is any normal member of J, define K(C) 
to consist of all members X of J such that either X CC or C’ CX. Now 
the collection K(C) is complete, since in the first place A is in K(C) since 
ACC. Secondly, if X isin K(C) so is its successor X’. For by the definition 
of K(C), either C’ CX orX CC. If 0’ CX, then C’ C X’. On the other 
hand, if X C C, then either X’ C C or C C X’, since C is normal. If X’ C C, 
then X’ is in K(C). But if CCX’, then XY CCC YX’, and since X’ has 
only a single element of P not in X, it follows that C =X or C =X’, whence 
C’ Cc X’ or X’ CC. In either case X’ is in K(C), which therefore contains 
the successor of each of its members. Likewise K(C) clearly contains the 
union of each chain of its chains, since the defining property X C C or C’ C XY 
of the collection K(C) goes over to unions. It follows that K(C) is complete, 
and since it is a subset of J, the smallest complete collection, K(C) must be 
identical with J. 

But by the defining property of the collection K(C), it follows that the 
successor C’ of every normal member C of J is also normal. Since the union 
of a chain of normal chains is clearly normal, it is seen that the collection 
of all normal members of J is complete, and hence is identical with the collec- 
tion J itself, since J is the smallest complete collection. Hence J is a chain 
of chains, since all of its members are normal. Since J is complete and also 
a chain, it must contain the union U of all of its members, and likewise it 
must contain the successor U’ of U, which is a proper superset of U. But 
this is impossible. This contradiction proves the theorem. 


Conclusion. Zorn’s lemma, which states that every collection of sets 
which contains the union of each chain of its sets has a maximal element, 
is an immediate consequence of the maximal chain theorem (cf. [1], [8], [9], 
[13]). Another formulation of Zorn’s lemma states that if every chain of a 
partially ordered set P has an upper bound in P, then P contains a maximal 
element (cf. [1], [8], [9]). 

The well-ordering theorem and the axiom of choice may also be derived 
as consequences of the maximal chain theorem by defining properly a partial 
ordering between the well-ordered subsets of a given set, or between the 
selection functions defined on subsets of a given set. 


THE PENNSYLVANIA STATE COLLEGE. 


ORRIN FRINK. 


REFERENCES. 


Garrett Birkhoff, Lattice Theory, revised edition, Colloquium Publications 25, 
New York, 1948, pp. 42-44. 

N. Bourbaki, Eléments des mathématiques, Théorie des ensembles I,, Paris, 1939, 
pp. 36-37. 

F. Hausdorff, Grundziige der Mengenlehre, Leipzig, 1914, pp. 140-141. 

G. Hessenberg, “ Kettentheorie und Wohlordnung,” Journal fiir die reine und ange- 
wandte Mathemetik, vol. 135 (1908), pp. 81-133. 

H. Kneser, “ Eine direkte Ableitung des Zornschen Lemmas aus dem Auswahl- 
axiom,” Mathematische Zeitschrift, vol. 53 (1950), pp. 110-113. 

C. Kuratowski, “ Une méthode d’élimination des nombres transfinis des raisonne- 
ments mathématiques,” Fundamenta Mathematica, vol. 3 (1922), pp. 76- 
108. 

R. L. Moore, Foundations of Point Set Theory, Colloquium Publications 13, New 
York, 1932, pp. 84-85. 

Szele, T., “ On Zorn’s lemma,” Publicationes Mathematicae Debrecen, vol. 1 (1950), 
pp. 254-257. 

J. W. Tukey, Convergence and Uniformity in Topology, Princeton, 1940, pp. 7-8. 

A. D. Wallace, “A substitute for the axiom of choice,” Bulletin of the American 
Mathematical Society, vol. 50 (1944), pp. 278. 

E. Witt, “On Zorn’s theorem,” Revista Matematica Hispano-Americana, vol. 10, 
pp. 82-85 (1950). 

E. Zermelo, “Neuer Beweis fiir die Méglichkeit einer Wohlordnung,” Mathe- 
matische Annalen, vol. 65 (1908), pp. 107-128. 

Max Zorn, “A remark on method in transfinite algebra,” Bulletin of the American 

Mathematical Society, vol. 41 (1935), pp. 667-670. 


678 
[1] 
[2] 
[3] 
[4] 
[5] 
[6] 
[7] 
[8] | 
[9] 
[10] 
[11] 
[12] 
[13] 


NOTES ON LEFT DIVISION SYSTEMS WITH LEFT UNIT.* 


By M. F. SMILEy. 


1. R. Baer introduced in [3] the notion of left division system with 
left unit * by showing that these systems arise in a natural way from a simple 
method of multiplying the cosets of a subgroup of a group. It is our con- 
tention that many of the properties of loops* are valid also for these much 
more general systems. This is the first of a series of notes in which we shall 
support this contention. Herein we are concerned with basic structure 
theory culminating in the lemma of Zassenhaus. We owe the brevity * of the 
proofs in part to suggestions of a referee. 

Let G be a system. We shall use H C ,G@ as an abbreviation for “ H is a 
subsystem of G.” We observe that (Z1) [e] CG if e is the left unit of G, 
and that (Z2) the intersection of a family of subsystems of G ts a subsystem 
of G. We shall then denote the subsystem of G which is generated by H, 
KC,G@ by <H, Ky. If K is the kernel of a homomorphism 7 of G onto a 
system G’, we shall write KC,G. If K CuG, we have (Z3) if HC,G, 
then Hn C .G’, (Z4) if H’ then C,G, and (Z5) if 
then Hn C nG’ = Gy. In order to establish (Z5) we first prove that a sub- 
system K of G satisfies K C nG if and only if (Kx) (Ky) = K(ry) = (Kr)y 
for every z,yeG. (Cf. Baer [4], p. 455, Lemma 1.) When K C,G, the 
mapping 27 — Kz is an isomorphism of G’, and G/K = [Kz; ze G]. 

Let us now mention a few immediate consequences of our definitions. 
If K is the kernel of a homomorphism 7 of G onto a system G’, and ay = yn 
for z,yeG, then «= ky for some ke K. From (Z3) and (Z4) we see that 
<H, K>n = <Hn, Kn> for H, K C sG. We note that K C,G and K CHC ,G 


* Received July 5, 1951; revised September 24, 1951. 

2 In the remainder of this note we shall use the word system in place of this longer 
phrase. Systems with unit were called left loops by Kiokemeister and Whitehead [13]. 
Their admissible left loops need not be normal in our sense, and there is no overlapping 
of our results for systems with theirs for left-loops. 

3 See [1, 2, 4-7, 10, 11]. 

‘The use of the associative law will not shorten our proofs. It is, of course, well- 
known that an associative system is a group. 


679 


9 
’ 
), 

4 


680 M. F. SMILEY. 


imply that KC,H. The modular law (KLN H) = (KN for sub- 
systems H, K, L of G with L C H may be proved as in Baer [4]. 
The principal result of this note is the following lemma. 


Lemma. If G is a system, LC,HC,G, and KC,G, then (1) 
HO KC,H, (2) KLC,KH, (3) (KILN H) CoH, (4) the mapping 
(KILN H)h— (KL)h with he H is an isomorphism of H/(KL ON H) and 
KH/KL, (5) if we have HC,G, then KH = HK C,G, and (6) tf we 
have KN HCL, then Lh—-(KL)h with he H is an isomorphism of H/L 
and KH/KL. 


Proof. Let 7 be a homomorphism of G onto a system G’ with kernel K. 
By (Z3), Hy is a subsystem of G’. Thus y induces a homomorphism of H 
onto the system Hy and the kernel of this induced homomorphism is HN K. 
This proves (1). Now set W = <H,K>. Then we have Wy = <Hn, Kn> = Hn, 
and WC KH, W=—=KHC.,G. If also thn (cf. 
Baer [4], p. 452). By (Z5), [jy CnHn = (KH)y. Let ¢ be a homomorphism 
of Hy onto a system H” with kernel Ly. Then y¢ is a homomorphism of KH 
onto H” with kernel KL. This proves (2). Again yd induces a homo- 
morphism of H onto H” whose kernel is KLM H. This proves (3), which is 
also an immediate consequence of (1) and (2). IfheH and ke K, we have 
(kh) nd = and it follows that (KL)(kh) = (KL)h,since (hk)n¢—>(KL) (kh) 
is an isomorphism of H” and KH/KL. We then obtain (4) by noting that 
>(KLI H)h is an isomorphism of H” and H/(KZM#H). The state- 
ment (5) follows from (2) and our previous observation that H C ,G@ implies 
HK =KH. Finally, (6) follows from (4) and the modular law. 


CoROLLARY (LEMMA OF ZASSENHAUS). Let G be a system, 
A, B, A;, B, © .G, A, C nA, Bi CoB. 
Then 
A,(AN B,) (ANB), B,(BN Ax) CnBi (BN A), 


and the identity mapping of G induces an isomorphism of the corresponding 
quotient systems. 


Proof. Since AN BC,A and A, C,A, (1) gives A,NBC,ANB. 
Likewise, B, 0 AC »zA NB. Using (5), we see that 


(A, B)(AN B,) = (AN B,) B) 


We set K = A, and G=A, noting tht KN H=4A,N BCL, apply (6), 
and interchange A and B in the result. 


4 
\ 
¥ 


Se “SS 


LEFT DIVISION SYSTEMS WITH LEFT UNIT. 681 


Remarks. 1. If @ is associative, then @ is a group, and our discussion 
includes this case. On the other hand, if @ is a loop, then K C »G does not 
imply that K is a normal subloop® in the sense of [1]. Thus our present 
exposition does not apply to loops. 


2. However, it is possible to formulate a list of axioms which hold for 
systems and for loops and which justify our Lemma. We are indebted to 
R. Baer who suggested that such a list must exist. We consider a set J of 
sets G with binary compositions which have a left unit e, for which za = bh 
has a solution x ¢ G for every a, b e G, and such that ra = a implies that 7 = e. 
If H is a subset of G which is an element of J relative to the composition of 
G, we write HC;G. Let 3 be a subset of J such that the requirements 
(Z1)-(Z5) of our second paragraph hold. We agree, of course, to replace 
“system ” by “element of , and “H is a subsystem of G” by “H CG 
and H is an element of 3.” We add the requirement: (Z6) If u is a homo- 
morphism of Ge onto G’e with kernel K, then K (zy) (Ka) (Ky) for 
every t,yeG. The interested reader will be able easily to adapt our proofs 
to these axioms if he adds the hypothesis K C ,»G to the statement of the 
modular law. 


3. It is interesting to observe that a recent theorem of R. C. Buck [12] 
on homomorphisms is valid for multiplicative elements of the set JF of 
Remark 2. But trivial examples show that both our lemma and Buck’s 
theorem fail in general. 


4, Similarity (isotropy) [8, 1, 2,11] will not reduce the study of systems 
to the study of loops. For if G = [1, a, 6], with 1 a left unit, a1 = 6, b1 —a, 
aa = ab —1, bab, bb —a, then G is a system which is not isotropic to a 
loop. The referee has remarked that every system is isotropic to a left loop; 
but, of course, not every left loop is a loop. 


THE STATE UNIVERSITY OF IOWA. 


5 For a simple modification of the example of Bates and Kiokemeister [9] shows 
that a loop may have a system which is not a loop as a homomorphic image. 


4 
5 
j 
= 


M. F. SMILEY. 


REFERENCES. 


[1] A. A. Albert, “ Quasigroups I,” Transactions of the American Mathematical Society, 
vol. 54 (1943), pp. 507-519. 

[2] , “ Quasigroups II,” ibid., vol. 55 (1944), pp. 401-419. 

[3] R. Baer, “ Nets and groups I,” ibid., vol. 46 (1939), pp. 110-141. 

[4] , “The homomorphism theorems for loops,’ American Journal of Mathe- 
matics, vol. 67 (1945), pp. 450-460. 

[5] ———, “Splitting endomorphisms,” Transactions of the American Mathematical 
Society, vol. 61 (1947), pp. 508-516. 

[6] , “Endomorphism rings of operator loops,” ibid., vol. 61 (1947), pp. 517- 
529. 

[7] ———, “ Direct decompositions,” ibid., vol. 62 (1947), pp. 62-97. 

[8] ———, “Direct decompositions into infinitely many summands,” ibid., vol. 64 
(1948), pp. 519-551. 

[9] G. E. Bates and F. Kiokemeister, “A note on homomorphic mappings of quasi- 
groups into multiplicative systems,” Bulletin of the American Mathe- 
matical Society, vol. 54 (1948), pp. 1180-1185. 

B. Brown and N. H. McCoy, “ Some theorems on groups with application to ring 


theory,” Transactions of the American Mathematical Society, vol. 69 
(1950), pp. 302-311. 

R. H. Bruck, “ Contributions to the theory of loops,” ibid., vol. 60 (1946), pp. 
245-354. 


R. C. Buck, “A factoring theorem for homomorphisms,” Proceedings of the 
American Mathematical Society, vol. 2 (1951), pp. 135-137. 

F. Kiokemeister and G. W. Whitehead, “A coset theory for left loops,” (Abstract), 
Bulletin of the American Mathematical Society, vol. 51 (1951), pp. 60-61. 


682 

[12] 

[13] 


A CHARACTERIZATION OF FINITE DIMENSIONAL 
CONVEX SETS.* 


By E. G. Straus and F. A. VALENTINE. 


Let S be a closed connected set contained in a finite dimensional subspace 
of a linear space, and let R, be the linear subspace of minimal dimension 
which contains S. A maximal convex subset of § is, by definition, one which 
is not contained in a larger convex subset of S. It is our purpose to establish 


the following result. 


THEOREM 1. The set S defined above, is convex if and only if each 
point xe 8 is contained in a unique maximal convex subset K(x) of S of 
dimension greater than or equal ton—1. (Note: Observe that no restrictions 
are placed on the maximal convex subsets of S of dimension less than n — 1). 


Definition 1. The property placed on S in Theorem 1 is called property 
A. The symbol | K(x)| denotes the maximal (n—1)-dimensional volume 
obtained from the class of all (n — 1)-dimensional plane projections of K(x). 
(| K()| may be finite or infinite). 


Lemma 1. If there exists a non-convex closed connected set S with 
property A, then there exists a non-convex continuum? 8, with property A, 


such that 


g.lb.| K(z)|=d>0. 


Proof. Choose a point z,¢S, and let Cm be the solid sphere in Ry of 
radius m with center at z,. Also let Fm =S- Cm, and denote the component 


of F, which contains x, by Tm. Since } Tm—=S, and since § is non-convex, 
m=1 


there exists a fixed value m such that 7,, is not convex. Consider now the 
non-convex set 7'm.i. In a well-established manner (cf. [2], Ch. 7), we may 
determine a topological space each element of which corresponds to one and 
only one of the maximal convex subsets of T'm,, of dimension =n—1. By 
this process of regarding maximal convex subsets of 7'm,, as points in a new 


* Received August 20, 1951. 
1A continuum is a compact connected set. 


=m 
he- 
cal 
64 
he- 
69 
op. 
683 


684 E. G. STRAUS AND F. A. VALENTINE. 


space, it is clear that Tm, is mapped into a set called T*m,,, and it is also 
easily verified that 7, is mapped into a non-trivial subcontinuum T*», of 
T*ms:- (It should be noted that although 7*,, will be closed, bounded and 
connected, 7*,, may not be closed). To each point z* ¢ T*m, we can assign 
the positive valued function f(x*) =| x* | =| K(zx)|, where K(z) is the maxi- 
mal convex subset of T'm,, corresponding to z*. This function f(z*) is upper 
semicontinuous on Hence the set S*, = {x* | f(z*) 21/k, 


is closed. If S*, were nowhere dense in 7*,, for each k, then T*m = > S8*; 
k=1 


would be of the first category in itself, which contradicts the fact that it is a 
non-trivial continuum. Hence, there exists a value k such that S$*; contains 
a non-trivial subcontinuum S*,. The pre-image S, of S*, must then be a 
non-convex continuum in 7'p,,, since Sy is a closed connected subset of the 
compact set 7'm.:. Moreover, S, satisfies property A, since each point ze 8, 
lies in a unique maximal convex subset K(x) of Tm,, of dimension = n—1 
which lies in S)(K(z) must intersect J). Also for each xe So, we have 
| K(x)| =1/k, so that 1. b. | K(r)| 21/k>0. 


LemMA 2. If S. and d are the quantities defined in Lemma 1, then 


g.Lb. |K(x)| —d. 

eSo, dim K(x#)=n-1 

Proof. The sets K(x) with dimension n — 1 are everywhere dense in 9*, 
since property A implies there exists at most a denumerable number of K (z) 
with dimension n. This combined with the upper semicontinuity of | K(z)| 


proves the lemma. 


Lemma 3. Each plane P through the centroiu of a bounded conver set 
K of dimension m and volume V divides K into two convex sets whose volumes 
V, and V, satisfy the inequalities 


Viz CnV (1 = 1, 2), 
where Cm is a constant depending solely on m and not on K or P. 


This lemma was proved for m = 2 by Neumann in [1], and for arbitrary 
m by Green and Gustin [unpublished]. 


Lemma 4. Consider the set S, in Lemma 1, and let K(x) be a maximal 
convex subset of dimension n—1 such that | K(x)| <(1+ Cn1)d, where 
Cn-1 ts defined in Lemma 3. Let T, be the right circular solid cylinder of 
radius r, whose axis passes through the centroid of K(x) and ts perpendicular 
to the plane of K(z). 


a> 


if 
i 


CONVEX SETS. 685 


Then there exists a neighborhood U of K(x) and a value of r such that 
for each point ye U- So, the set K(y) intersects all the elements of T;,. 


Proof. If the lemma were false, there would exist a sequence of convex 
sets K(y:) whose distance from K(x) would approach zero, and such that the 
following would hold. The sequence K(y;) would contain a subsequence which 
would converge to a convex subset C of K(x) which does not contain the 
centroid of K(«) in its interior. Due to the upper semicontinuity of | K(z)|, 
we have |c|=d. Hence, the above with Lemma 3 implies that | K(z)| 
=|c|+¢n,.d= (1+ ¢,1)d, which contradicts our hypothesis. 


Sufficiency proof for Theorem 1. Assume that § is not convex, so that 
Lemmas 1 to 4 hold. We use the notation in Lemma 4. According to a 
known theorem in topology ([2], p. 16), there exists a non-trivial component 
C of So: U containing K(x). Since S, is non-convex, C is non-convex. Choose 
a point y,eC. Since K(y:)-T; is a convex set dividing T, into two parts, 
let LZ, be any line segment parallel to the axis of T,, and having its endpoints 
in K(y:)-T, and K(x) respectively. Let LZ be the line containing Z,. For 
any point ye C, Lemma 4 implies K(y)- L540. Hence, since C is a con- 
nected closed bounded set, property A implies that C-Z is a closed line 
segment. Hence Z, C C. But this implies that the portion of T, between 
K(x) and K(y,) isin C. However, this contradicts the fact that K(x) is of 
dimension »—1. This completes the sufficiency. The necessity of property 
A is obvious. 


An interesting corollary to Theorem 1 for sets in R2, the plane, is the 
following. 


Corottary 1. If 8 is a closed connected set in Re, each point of which 
belongs to a unique maximal linear element (line segment, half-line or line) 
of S, then S is a linear element. 


Concluding remarks. One may ask in what respects our theorem is the 
best possible. It is the best possible in at least the following respects. 


If we remove the assumption dim K(x) = n—1, then the theorem no 
longer holds, as shown by the circular cylinder 2,2 + 2,7 —1 in F,. 

It might be conjectured that our affine theorem has a purely topological 
origin. Thus one might think that if we replace the phrase “ unique maximal 
convex subset of dimension =n—1” by “unique maximal closed (in the 
point set sense) surface of dimension »—1” then the conclusion would be 
“§ is an m—1 dimensional closed surface.” However, P. Erdés and A. H. 


of 
nd 
gn 
Der 
=} 
ins 
> a 
he 
S, 
1 
ive 
+, 
ec OF 
es 
al 
ore 
of 
lar 


686 E. G. STRAUS AND F. A. VALENTINE. 


Stone have communicated to us the following counterexample for the case 
n= 2. Consider a Cantor set in the interval (0,1) of the X-axis. On each 
of the points of this set we erect a line segment of unit length perpendicular 
to the X-axis on which y= 0. Consider the intervals in the complement of 
the Cantor set. For each interval J, draw the diagonal segment joining the 
upper end point of the vertical segment at the left endpoint of J to the right 
endpoint of J. The resulting set S of vertical segments plus diagonal seg- 
ments is connected, and every point in S belongs to a unique maximal closed 
are (either a vertical segment or a polygonal line consisting of two vertical 
segments plus a diagonal segment). However, S is not an arc. 

The following question is still unsettled. Suppose S C Ry is a closed 
connected set such that each of its points is contained in a unique maximal 
closed connected (n—1)-dimensional subset of S which is contained in an 
(n—1)-dimensional hyperplane of Is then S an (m— 1)-dimensional 
set which is contained in an (n —1)-dimensional hyperplane? If n = 2, the 
question is answered in the affirmative by Corollary 1. For n > 2, the question 
is still undecided. 


UNIVERSITY OF CALIFORNIA AT LOS ANGELES. 


REFERENCES. 


[1] B. H. Neumann, “On an invariant of plane regions and mass distributions,” 

Journal of the London Mathematical Society, vol. 20 (1945), pp. 226-237. 
[2] G. T. Whyburn, Analytic Topology, American Mathematical Society Colloquium : 
Publications (1942). M 


j 

| 


ON ADDITIVE IDEAL THEORY IN GENERAL RINGS." 


the By Cuaries W. Curtis. 
cht 
eg- 
sed 7 Introduction. It is the purpose of this paper to present some contribu- 
cal |] tions to the structure theory of non-commutative ideal lattices, as developed 
f by Krull [10]? and Dilworth [3]. The results of Part 1, except for (5) of 
sed [# Lemma 1.1 and the first part of Theorem 1.4, hold in an arbitrary non- 
nal commutative residuated lattice in the sense of Dilworth [3]. The results 
an are stated only for ideals? in rings, however, since our interest is the 
nal application of these results to the structure theory of rings. The main result 
the of the paper is the determination in 1.4 of the maximal elements in the 
ion |@ inclusion ordered set of right associated primes of an ideal. In 1.1, 1.2 
/ 1.3 a decomposition theory of ideals is worked out, similar to the work of 
‘| Fuchs [5] for commutative rings. In 1.5 Krull’s theory [10] of right asso- 
I ciated prime ideals is linked to the theory of primary ideals. The notion of 
q isolated component ideal leads to a new approach to the uniqueness theory 
4 of primary ideals. In 1.6 it is proved that if the ring FR satisfies the 
d ascending chain condition (A. C.C.) for right ideals, and if every ideal in 2 
4 is a finite intersection of primary ideals, then the intersection of the powers 
'] of the Jacobson radical is the zero ideal. The methods presented here do not 
P lead to a proof of this theorem for arbitrary rings with A.C.C. for right 
i ideals because of an example due to E. Noether, published in [10], of a ring 
im || With A.C.C. for right ideals having the property that not every ideal is an 


intersection of primary ideals. 
I wish to thank Professor N. Jacobson for the encouragement and many 
helpful suggestions he has given me during the ‘preparation of this paper. 


Part 1. The General Theory. 


1.0. Notations. In this paper FR will denote a non-commutative ring 
with a unit element 1. Proper ideals in R will be denoted by A, B,C,-- -, 
and elements of R by z,y,a,b,----. We shall use the symbol 0 both for 


1 Received June 29, 1951. 
* Numbers in brackets refer to the list of references at the end of the paper. 
* “ideal” always means “ two-sided ideal.” 


687 


ase 

ach : 

lar 


688 CHARLES W. CURTIS. 


the zero ideal and for the zero element of R. {3C|C has the property P} 
means the join of those ideals C having the property P, while {z|a has the 
property P} means the set of all elements x of FR having the property P. 


1.1. Primal ideals. 


Definition 1.1. The quotient AB-* of the ideals A and B is defined by 
= {3C|CBCA}. Similarly BA = {3C| BC C 4}. 


We observe that BIA ={x|BcCA}. The 
following properties of the quotients are well known ([10], [3]). 


Lemma 1.1. 
(1) (AB“*)BCA, AB*=R if BCA, and if B, CB, then 
AB, D 


(2) = A(BC)-. 


(4) - AB, and more generally, 
if {B,} ts an arbitrary collection of ideals, 


(5) = ,AB,". 
Similar rules hold for the quotients B-tA. 


Definition 1.2. If AB-*—A (B*A =A) then B ts right (left) prime 
to A. 


Definition 1.3. A is right primal * if the join P of the ideals not right 
prime to A is again not prime to A. If A is primal, then P is called the 
adjoint ideal of A. 


Let R be a commutative ring. If G is primal in the sense of Definition 
1.3 then G@ is primal in the sense of Fuchs [5]. The converse is false.® 


« Although there is also a theory of left primal ideals, we shall consider only right 
primal ideals. Henceforth, “ primal” means “right primal,” and “ prime to A ” means 
“right prime to A.” 

5 This point is settled by the following example, due to the referee. Let R be the set 
of all finite expressions a,¢%: + .- - - + a,t"r, with coefficients in a field, and exponents 
non-negative rational numbers. With respect to the obvious definitions of addition and 
multiplication, R is a commutative ring in which the principal ideal generated by ¢ is 
primal in the sense of Fuchs, but not primal in the sense of Definition 1. 3. 


4 
| 
| 
| 
| 
' 


ON ADDITIVE IDEAL THEORY IN GENERAL RINGS. 689 


Definition 1.4. P is a prime ideal ® in an arbitrary ring R, if ABC P 
implies AC P or BCP. 


From Lemma 1.1, (2) we have 


LemMA 1.2. If A is primal, then the adjoint ideal P of A ts a prime 
ideal. 


Definition 1.5. A is strongly irreducible if A cannot be expressed as an 
intersection, finite or infinite, of proper divisors of A. A is irreducible if A 
cannot be expressed as an intersection of a finite number of proper divisors 
of A. 


From Lemma 1.1, (5) and (4), we obtain at once 


LemMA 1.3. Every strongly irreducible ideal is right primal. If R 
satisfies the A.C.C. for ideals, then every irreducible ideal is right primal. 


THEOREM 1.4. Every ideal in R is the intersection of its primal divisors. 
If R satisfies the A.C.C. for ideals, then every ideal in R is an intersection 
of a finite number of primal ideals.? 


Proof. Let A be an ideal in R. Since there are no ideals not prime 
to the ring FR itself, we shall agree that R is a primal ideal. In order to 
prove the first part of the theorem it is sufficient to prove that if c#¢ A, then 
there exists a primal divisor G of A not containing c. From Zorn’s lemma, 
however, it follows that there exists a divisor G of A having the property that 
every proper divisor of G contains c. G is therefore strongly irreducible, and 
by Lemma 1.3, primal. The second part of the theorem is an immediate 
consequence of the A.C.C. and Lemma 1. 3. 


1.2. Uniqueness of primal decompositions. 


Defimtion 1.6. Noether [17]) The intersection A= G,N---N Gy 
is trredundant if no G; divides its complement 


* This definition is due to Krull [10]. 

‘The first right principal components defined by Krull in [10] are primal ideals, 
and therefore a theorem of [10], which states that every ideal is the intersection of its 
first right principal components, is an instance of Theorem 1.4. I am indebted to the 
referee for pointing out to me that since an ideal A is strongly irreducible if and only 
if the ring R/A is subdirectly irreducible in the sense of Birkhoff (“ Subdirect unions 
in universal algebra,” Bulletin of the American Mathematical Society, vol. 50 (1944), 
pp. 764-768), Theorem 1.4 is a consequence of Birkhoff’s result, which states that an 
arbitrary ring is a subdirect sum of subdirectly irreducible rings. 


‘ 
yi 
| 


690 CHARLES W. CURTIS. 


The intersection is reduced if no G; can be replaced by a proper divisor. Inter- 
sections which are both irredundant and reduced are called normal. 


Throughout the remainder of this section we shall assume that R satisfies 
the A.C.C. for ideals. The idea of the next result is due to Fuchs [5]. 


THEOREM 1.5. Let A=G,N---:NG, be a reduced representation ® 
of A by primal ideals G; with adjoint prime ideals P;. An ideal B is not 
prime to A tf and only if B is contained in one of the P. 


Proof. By (4) we have 


Since the intersection is reduced, AB*=+4 A if and only if GB G; for 
some 1. But G,B* + G; if and only if B C Pi, and the proof is complete. 


THEOREM 1.6. The reduced intersection of a finite number of primal 
ideals A=G,N---+- G, with adjoint ideals P,,- --,P, is primal tf and 
only if one prime P; divides all the others. 


Proof. First let some P;, say P; divide all the other P;. By Theorem 1. 5, 
if AB* A, then BCP; CP, for some 7; hence if denotes the join of 
the ideals not prime to A, SC P;. But again by Theorem 1.5 we conclude 
that AS-? = A, and hence A is primal. 


Conversely, let S be the adjoint ideal of A. Since AS-1=+4 A, S C P; for 
some 1. But > P;C S C Pi, and the theorem is proved. 


j=1 
THEOREM 1.7. Every ideal in R ts a normal intersection of primal 
ideals, such that no adjoint prime divides another. 


Proof. By the A.C.C. A is an intersection of irreducible ideals, and we 
can assume that the intersection is irredundant. By Lemma II of E. Noether’s 
paper [17] which holds, together with the other results we shall require from 
that paper, in the non-commutative case, the intersection is necessarily reduced. 
By Lemma 1.3, the ideals appearing in the intersection are primal ideals. 
Let their adjoint primes be P;,- - -,P,», and suppose the indices chosen so 
that P,,- - -, P; are the maximal elements in the set {P;}, ordered by inclusion. 
If we replace the intersection G, of those primal ideals whose adjoint primes 
are P, or a multiple of P, by G, itself, then G, is, by Theorem 1. 6, a primal 


® By a representation of A, we mean an expression of A as an intersection of some 
of its divisors. 


4 
| 


es 


or 


or 


me 


ON ADDITIVE IDEAL THEORY IN GENERAL RINGS. 691 


ideal, and by Lemma IV of [17] the resulting intersection is still reduced. 
Next we replace the intersection G, of those ideals, not already incorporated 
into G,, whose adjoint ideals are P, or a multiple of P, by G2, and again by 
Lemma IV of [17] the intersection is reduced. By repeating this process k 
times, we obtain at last a normal representation of A by primal ideals having 
the desired properties. 


THEOREM 1.8. If A has two normal representations by primal «deals, 
such that no adjoint prime in either representation divides another in the 
same representation, then the adjoint primes in the two representations and 
the number of components are the same. 


Proof. Let A=GiN+-->-NGa=G*N-++-NGn* be two normal 
primal representations of A satisfying the hypotheses of the theorem. Let 
the adjoint primes be P,,- --,P, and P,*,- - -,Pm*. We shall prove that 
P, is contained in some P;*. Since both representations are reduced, we can 
apply Theorem 1.5 once to conclude that P, is not prime to A, and again 
to conclude that P; is contained in some P;*. By symmetry we can show that 
P;* C P; for some k. Thus P; C P;* C P;, which contradicts our hypothesis 
unless P, = P;* = Py. The rest of the proof is now clear. 


1.3. Maximal prime ideals. McCoy defined a set 9 in a ring R to be 
an m-system if a,beS8 imply the existence of an element xe FR such that 
axbe 8. The empty set is considered to be an m-system. He then defined 
the radical M(A) of an arbitrary ideal A in R to be the set of elements r 
such that every m-system containing r contains an element of A. McCoy 
proved ([15], Theorem 2) that the radical M(A) of an ideal A is the inter- 
section of the minimal prime divisors of A, thus achieving a successful 
generalization of the work of Krull [11]. lLevitzki has sharpened this result 
by proving in [14] that M(A)/A is actually the lower radical in the sense 
of Baer [2] of R/A. 

For commutative rings, Fuchs has characterized in [6] the structure of 
the intersection of all maximal prime ideals belonging to an arbitrary ideal. 
For non-commutative rings R satisfying the A. C. C. for ideals, we shall prove 
a result analogous to Fuchs’ theorem. 


Definition 1.7%. Let P be a divisor of A, where ASR. P is a maximal 
prime ideal belonging to A if (i) AP-++4 A, and (ii) if Q is a proper divisor 
of P, then AQ-*== A. P is a minimal prime ideal of A if P is a prime ideal, 
and if there exists no prime ideal Q such that AC QCP. 


11 


4 
4 

8 
4 

al 

vd 

5, 

of 

e | 

= 

al 

we 

r’s 

ym 

ls. 

80 

yn. | 

| 

al 

(| 

|_| 


692 CHARLES W. CURTIS. 


It should be observed that any ideal satisfying (i) and (ii) is necessarily 
a prime ideal, and consequently the definition is meaningful in the form in 
which we have stated it. 

In order to conclude that A has at least one maximal prime ideal, it is 
enough to assume the A.C.C. for ideals. Throughout this section we shall 
assume the A.C.C. for ideals. 


Definition 1.8. The join S of all ideals C such that A(C+ B)*+~A 
whenever AB ¥ A is called the adjoint ideal of A (compare Fuchs [6]). 


Since A(A + AB*=RN = AB, We see that 
ACS. Until now we have spoken of the adjoint ideal only in connection 
with primal ideals. We shall prove that if A is a primal ideal with adjoint 
prime 8’, then S = 8’. In fact if ce 8, then (c)® is certainly not prime to A; 
hence (c) C 8’, and we conclude that SCS’. Conversely, let AB = A. 
Then A(S’ + B)-? = A(S’)* A; hence 8’ C8. 


THEOREM 1.9. The adjoint ideal of A is the intersection of all maximal 
prime ideals of A. 


Proof. Let c be contained in the adjoint ideal S of A, and suppose that 
a maximal prime ideal P of A does not contain c. Then P + (c) is a proper 
divisor of P, and since AP*+4 A, A(P+ (c))*S A, contrary to our 
assumption that P is a maximal prime belonging to A. Conversely, let d be 
contained in every maximal prime of A, and let B be any ideal not prime to A. 
Then by the A.C.C. B is contained in a maximal prime, say P*, and 
B+ (d) C P*; hence B+ (d) is not prime to A. Thus (d) and hence d 
is contained in the adjoint ideal of A. 


By virtue of a remark of McCoy ([15], page 829) any prime ideal con- 
taining A contains a minimal prime of A. Since the McCoy radical is the 
intersection of all the minimal primes of A, it follows from this remark and 
Theorem 1. 9 that the adjoint ideal of A always contains the McCoy radical. 

We have defined the set of maximal primes ideals belonging to A inde- 
pendently of a particular representation of A by primal ideals. The next 
theorem shows the connection between maximal prime ideals and primal 
representations. 


THEOREM 1.10. Let A=G,N-+--NG, be a normal primal decom- 
position of A, with adjoint primes P,,:--,Pn. Then an ideal P is a 
maximal prime ideal belonging to A if and only if P is one of the Pj. 


*(c) is the principal ideal RcR generated by c. 


3 
$ 
3 
| 
4 
| A 
4 
é 
a 


i 


ON ADDITIVE IDEAL THEORY IN GENERAL RINGS. 693 


Proof. We shall prove first that the P; are maximal primes belonging 
to A. By Theorem 1.5, AP;*=4 A for each i, and again by Theorem 1. 5, 
since the representation is reduced, each divisor of P; is prime to A,1 Sin. 
Consequently the P; are maximal primes belonging to A. Conversely if P is 
any maximal prime belonging to A, then AP-*=+4 4A, and by Theorem 1. 8, 
PCP; for some 7. From the maximality of P and the fact that AP; A, 
we conclude that P = P,. 


CorotLaRy 1.11. A is primal if and only if A has exactly one maximal 
prime ideal. 


1.4. Associated prime ideals. In the additive ideal theory of commu- 
tative rings it is desirable to give a definition of the prime ideals “ associated ” 
with a given ideal, independently of the notion of primary ideal. Krull gave 
such a definition for commutative rings in [9], and for non-commutative 
rings in [10]. We shall follow his methods in this section. 


Definition 1.9. The ideal I is a (right) isolated component ideal 
(I.C.1I.) of A if there exists an ideal B and an integer g >0 such that 
[ = == AB 


We shall assume throughout this section that R satisfies the A.C.C. for 
ideals. If A is a given ideal, and B an arbitrary ideal, then we have in general 
AB* C AB’ C: - -, and by the A. C. C. there exists an integer q such that 
AB4= AB+' —---; the I.C. I. AB is called the I. C. I. generated by B, 
and we shall denote it by Z(A, B), or more simply, when it is clear from the 
context that A is the basic ideal in the discussion, by I(B). 


Definition 1.10. A prime ideal P is a (right) associated prime ideal 
of A if (i) Z(P) is a proper divisor of A, and (ii) 1(P)*A CP. 


LemMMA 1.12. Every maximal prime ideal P of A is a right associated 
prime ideal of A. 


Proof. Since AP-*~ A, I(P), which contains AP-, is a proper divisor 
of A. It remains to prove that J(P)-*A CP, that is, if C is an ideal such 
that 1(P)C CA then CCP. Let D—AP-; then D is a proper divisor 
of A, and DPC A. Since DCI(P), we have also DOC A. Combining 
these results we obtain D(P + C) CA, and consequently A(P ++ C)*A. 
Since P is a maximal prime of A, P + C CP, and we have C CP. 


LemMA 1.13. Every right associated prime of A is contained in a 
maximal prime of A. 


4 
1 
a 
l 
{ 
t 
t 
3 
t 
A 
r 
r 
d ‘ 
4 
d 
i 
j 
ct 
ol 
- 
a 


694 CHARLES W. CURTIS. 


Proof. This result follows immediately from the fact that if P is a right 
associated prime of A, then AP*=+ A. 


From Lemmas 1.12 and 1.13 we obtain at once : 


THEOREM 1.14. A prime ideal P is a maximal prime ideal of A if and 
only if P is a maximal element in the inclusion-ordered set of right associated 


prime ideals of A. 


1.5. Primary ideals. In this section we assume that RF satisfies the 
A. C. C. for ideals. 


Definition 1.11. An ideal Q is (right) primary if for arbitrary ideals 
A and B, ABC Q, AC Q implies B* C Q for some positive integer r. As 
usual “ primary ” means “right primary” from now on. If Q is primary 
then the ideal 


P = {3B| Br CQ for some positive integer r} 


is called the prime ideal belonging to Q, or simply the prime ideal of Q. 

It is easy to verify that since R satisfies the A. C.C., the ideal P defined 
above actually is a prime ideal. 

If Q is primary, then Q is primal with adjoint prime P. The converse | 
is false, as an example given in [5], page 2 indicates. | 

Let A be an ideal which is a finite intersection 


(6) 


of primary ideals. Dilworth has observed ([3], Theorem 6.1) that the inter- 
section (6) can be refined by uniting those ideals Q; having the same prime, 
obtaining an expression for A as an irredundant intersection Qmn'* 
of primary ideals Q,;*, where Q;* N Q;* is not primary if 147. Such an 
intersection we shall call a shortest representation of A by primary ‘deals, 
or more briefly, a S.R of A. Dilworth’s result also states that for any S. R. 
of A, the primes and the number of primary ideals are uniquely determined. | — 
These results have also been announced by Murdoch [16]. j 

Let A=Qi:N---NQ, be a S.R. of A by primary ideals Q; with |7 
primes P;. Consider a subset S of the set {P;} having the property that if 
P,e 8, then P;C P; implies P;e S. The intersection of the primary ideals ‘ 
belonging to the primes in S is called a Noether isolated component of A. | 

The methods of Krull [9] lead directly to a proof of the next result. | ~ 


THEOREM 1.15. If B is an ideal, then AB-*— AB? if and only ¢f 


| 
| 


ght 


and 
ited 


the 


eals 
As 
ary 


ned 


erse 


ON ADDITIVE IDEAL THEORY IN GENERAL RINGS. 695 


A, = AB" is the ring R or a Noether isolated component of A appearing in 
every S.R. of A. Furthermore, if A; is any Noether isolated component of A, 
there exists an tdeal B such, that A, = AB. 


CoroLLARY 1.16. A, ts a right isolated component ideal of A if and 
only tf A, is a Noether isolated component of A. 


The following theorem is an application of Theorem 1.15 and the 
methods of Krull [9]. Since the previous theorem can be proved indepen- 
dently of Theorem 6.1 of [3], Part (b) of the next result (which is an 
immediate corollary of Part (a)) gives an alternative approach to Theorem 
6.1 of [3]. 


THEOREM 1.17. 


(a) The set of right associated primes of A 1s identical with the set of 
prime ideals belonging to the primary ideals in every S.R. of A. 

(b) The number of primary ideals in a §.R. of A and the primes 
belonging to them are uniquely determined. 


(c) The Noether isolated components of A are uniquely determined by 
their associated primes. 


1.6. On the powers of an ideal. Let FR bea ring satisfying the A. C. C. 
for right ideals, and having the property that every ideal in F is an inter- 
section of a finite number of primary ideals. If A is an ideal in R, let 


AY’ == {| A‘. The first part of the proof of Satz 1 of [12] can be transferred 


4=1 
to a ring satisfying the above hypotheses, proving 
Lemma 1.18. A®°A =A”. 
THEOREM 1.19. If J 1s the Jacobson radical of R then J* =0. 


Proof. Lemma 1.18 and a result of Jacobson [8, Theorem 10]. 


Part 2. On Some Examples. 


2.1. We shall say that R has a Noetherian ideal theory if i) R satisfies 
the A.C. C., and ii) every ideal in R is an intersection of a finite number of 
(right) primary ideals. The following example, mentioned in the intro- 
duction, shows that not every ring with A. C. C. has a Noetherian ideal theory. 
Let K be the field of rational numbers, and let R be the algebra over K with 


4 
| 
me, | 
an | 
als, | 
ed, 
t if | i 
eals | 
t 
y if 


696 CHARLES W. CURTIS. 


basis elements ¢2,n and the multiplication table e;? = 11,2; n?*=0; 
= = 0; Neg —N; A straightforward deter- 
mination of the ideals in R leads to a verification of the fact that the zero 
ideal of R is neither primary nor an intersection of primary ideals. 

In view of this example and the fact that the results of 1.5 and 1.6 
are valid for rings with a Noetherian ideal theory, it is important to consider 
examples of rings having a Noetherian ideal theory. It follows easily from 
the fundamental homomorphism theorem that if R has a Noetherian ideal 
theory, so has any homomorphic image of #. Another class of examples is 
furnished by finite matrix rings over rings with a Noetherian ideal theory. 

From the results of Fitting [4] it follows directly that if R is a ring 
satisfying the A.C.C. for ideals, and having the property that every right 
or left ideal is a two-sided ideal, then R has a Noetherian ideal theory. This 
class of rings contains all Noetherian rings.’ 

Let # be a ring with A. C.C. for ideals, satisfying the conditions that 
if P is a prime ideal in R, different from the zero ideal, then P is maximal. 
and if P, and P, are prime ideals in R, then P,P, —=P,P,. We shall apply 
the theory of primal ideals to prove that R has a Noetherian ideal theory. 
It is sufficient to prove that if A is an ideal in R (possibly the zero ideal), 
then A is a finite intersection of primary ideals. Let A=QiN---NQn 
be a representation of A by primal ideals Q; with adjoint prime ideals P,. 
We shall prove that each Q; is primary. In fact, consider Q,. Let P, be a 
minimal prime ideal of Q,. Since the prime ideals commute, it follows from 
[10], Theorem 6 that Q,P,+>4Q,, and hence P, C P,. Hither P, —0 and 
hence A = 0, which shows that A is prime, or P, —P,. In the latter case 
Q, has the unique maximal and minimal prime P,, and it follows that Q, 
is primary. Similarly the ideals Q.,- - -,Q, are primary. Examples of rings 
of this type are non-commutative principal ideal rings, and more generally, 
orders satisfying the axioms of Asano.*? 

We shall digress for a moment to consider other types of ideals related 
to primary ideals. We say that Q is strongly right primary if abe Q, a¢Q 
imply b*< Q for some positive integer r. 

In the commutative case, this definition coincides with Definition III 
given by E. Noether in [17], while our Definition 1.11 coincides with 
Definition IIIa of E. Noether. For commutative rings the two definitions 


*° A Noetherian ring is a commutative ring with unit element, satisfying the 
A.C.C. for ideals. 
11 Cf. [7], Chapters III and VI. 


4 
4 
4 
3 
q 
4 
| 


ON ADDITIVE IDEAL THEORY IN GENERAL RINGS. 697 


are equivalent if R satisfies the A. C.C., but this is no longer true for non- 
commutative rings. 

Let RF satisfy the A.C.C. for right ideals. If Q is strongly primary, 
then Q is primary, and we shall give an example to show that the converse 
is false. Let Q be strongly primary, and suppose AB CQ, A CQ, where 
A,B are ideals in R. By definition B/Q is a nil ideal in the ring R/Q, 
which also satisfies the A. C. C. for right ideals. By a result of Levitzki [13] 
every nil ideal in R/Q is nilpotent, and this proves that Q is primary. For 
the example, let D be a finite dimensional central simple algebra, and let 
R= D,, the ring of n by n matrices with coefficients in D, forn>1. Risa 
simple algebra, and its zero ideal is prime, and a fortiori primary. It is easy 
to find zero divisors in R. 

Murdoch [16] has defined an ideal Q to be primary if akb CQ, a#¢Q 
implies )e M(Q), where M(Q) is the McCoy radical of Q. We shall prove 
that for rings satisfying the maximum condition for right ideals, Murdoch’s 
definition is equivalent to Definition 1.11. In fact, let Q be primary 
according to 1.11, and let aRb CQ, where af Q. It follows that the ideal 
RoR is nilpotent modulo Q; hence ROR C M(Q), and be M(Q), since M(Q) 
is a radical ideal. Conversely let Q be primary in the sense of Murdoch, and 
lett ABC Q, AG Q where A and B are ideals. It follows that B is a nil 
ideal modulo Q, and by Levitzki’s result again, B is nilpotent modulo Q. 


2.2. Let S be a simple ring with unit element. Then it is well known 
that the center ® of S is a field, and S is a central simple algebra over ®. 
Let S[Xi,---,Xn], or more briefly S[X], denote the algebra over ® of 
polynomials in n indeterminates X;, where we assume that the indeterminates 
are commutative with one another and with the elements of §. With the 
same assumptions concerning the Xj, let S{X1,---,Xn}, or more briefly 
S{X}, denote the algebra over & of formal power series in the variables Xj. 
Let R be either S[X] or S{X}. In this section we shall prove that if either 
k= S8[X] and 8 is arbitrary, or if R = S{X} and (S:)<o, then R has 
a Noetherian ideal theory. 

Let M be the monomial basis for R, that is M consists of all monomials 
X,4X,%- - -X,° where the e are non-negative integers. Let fe R. Then 
f can be written uniquely in the form f = 3a(m)m,a(m) ¢S8,meM, where 
a(m) is always a finitely valued function if R—C[X], and a(m) is an 
arbitrary function if R= S{X}. Let B= (b,c,-- -) be a basis for S over 
®. Then a(m) can be written uniquely as a(m) = 3A(m, b)b, A(m, b) & 4, 
where the functions A(m, b) are b-finitely valued, that is the matrix (A(m, b)) 
is row finite. We have 


0; 
4 
TO 
.6 
er 
a 
ym 
al 
is 
4 
ng 
ht & 
is 
at 
. 
a 
m 
nd 
se 
3 
gs 
Il 
th 
1S + 
he 


698 CHARLES W. CURTIS. 


(1) f = b)b)m = Ym,pd(m, b)bm 


uniquely, and conversely if (u(n,c)) is any row finite matrix on M XB 
to g=Xy(n,c)en is an element of R. Let bc = X4é(b,c,d)d, and 
mn = Xpn(m,n,p)p (for each b,c, €(b,c,d) is d-finitely valued, and for 
each m,n, (m,n, p) is p—1-valued.) Then 


(2) fg = (2A(m, b) bm) (Sp(n, c) en) 
b)u(n, c)é&(b, C; d)n(m, n, p) dp, 


(m,n,b,c,d,p 


where the matrix 
(0(p,4)) d)n(m, n, p)) 


is row finite. 

Now consider the vector space V over ® of all mappings on the product 
set BX M-—>®. If b ®& m is the mapping which assigns 1 to the pair (b, m) 
and zero to everything else, then every element x of V can be expressed 


uniquely in the form 


(3) 3A(m,b)b @ m. 
The scalar multiplication, of course, is given by 
(4) ar = Sar(m, b)b m, ae®. 


Consider the subspace W of V consisting of all vectors (3) for which (A(m, b) ) 
is a row finite matrix. If we define a multiplication in W by means of (2), 
then with respect to (2) and (4), W is an algebra over ®. 

Let us assume that some 0 is the unit element 1 of 8. Then the sub- 
algebra of W consisting of all vectors 3A(1,m)1® m, where A(1,m) is 
always m-finitely valued or m-infinitely valued depending upon whether PF 
is a polynomial or a power series algebra, is isomorphic to @[X] or ®{X} 
respectively, and we shall denote it by 1@ @[X] (resp. 1 #{X}.) 
Similarly the set of vectors 3A(b,1)b ®& 1, where 1 = X,°X,°- - -X,°, and 
where A(b, 1) is always b-finitely valued, forms a subalgebra of W isomorphic 
to S, which we shall denote by S ® 1. The algebra W* generated by finite 
sums of products of elements from 1 ® @{X} and 8 & 1 does not equal W 
unless § is finite dimensional, but in the polynomial case W* — W without 
any restriction on 8S. 

In the polynomial case W is simply the Kronecker product of the algebras 
S and @[X], but in the power series case, when (S:®)< 0, W is some sort 


of a generalized Kronecker product. Let D—@[X] or ®{X} depending 


ON ADDITIVE IDEAL THEORY IN GENERAL RINGS. 699 


upon whether R = S[X] or S{X}. Then we shall write W = S ®@ D, and 
call it the Kronecker product #* of § and D. 
From (1), (2), (3), and (4) we see that the mapping 


(5) m)bm 3A(b, m)b m 
is an isomorphism of R onto S ® D. We require now the following result. 


TurorEM. If U is an ideal in D, then S & U is an ideal in S ® D, 
and the mapping U->S ® U is a lattice isomorphism of the lattice of ideals 
of D onto the lattice of ideals of S® D, provided either (S:@)< or 


In the polynomial case, where S ® D is the usual Kronecker product, 
the result was proved by Nakayama and Azumaya [1]. When (S:®)<o0, 
the following observation and method used by Jacobson ** in proving the 
theorem of Nakayama and Azumaya, leads to a proof for the power series 
case. The essential point in Jacobson’s proof is that if ve S ® D, then v 


can be expressed as a finite sum v = Da; & fi, ae S, fie D, where the a; are 
¢=1 
linearly independent in 8. From our remarks above, this can be done if and 
only if (S:®)<o. 
If the conditions of the theorem hold, then it is easy to verify that if A 
and B are ideals in D, then 


(6) S®AB=(S @A)(S @B). 

From the Theorem and (6) it follows that if Q is a primary ideal in D, then 
S@ Q is primary in 8 @ D. Since D is a Noetherian ring, the theorem 
impiles that S ®) D has a Noetherian ideal theory, and finally, from (5), 


since # is isomorphic to S ® D, we conclude that R has a Noetherian ideal 
theory. 


UNIVERSITY OF WISCONSIN. 


*? It is not difficult to show that the structure of W is independent of the particular 
bases we have chosen for S and D. 

*8 Cf. Theorem 3, Chapter VI, of a book on the structure theory of rings, to appear 

in the Annals of Mathematics Studies. 


J 
/ 
n 
3 


[1] 


[2] 


CHARLES W. CURTIS. 


REFERENCES. 


G. Azumaya and T. Nakayama, “On irreducible rings,” Annals of Mathematics, 
vol. 48 (1947), pp. 949-965. 

R. Baer, “ Radical ideals,” American Journal of Mathematics, vol. 65 (1943), 
pp. 537-568. 

R. Dilworth, “ Non-commutative residuated lattices,” Transactions of the American 
Mathematical Society, vol. 46 (1939), pp. 426-444. 

H. Fitting, “ Primairkomponentenzerlegung in nichtkommutativen Ringe,” Mathe- 
matische Annalen, vol. 111 (1935), pp. 19-41. 

L. Fuchs, “ On primal ideals,’ Proceedings of the American Mathematical Society, 
vol. 1 (1950), pp. 1-8. 

L. Fuchs, “On a special property of the principal components of an ideal,” Det 
Kongelige Norske Videnskabers Selskab, vol. XXII (1950), pp. 28-30. 

N. Jacobson, “The theory of rings,’ Mathematical Surveys, Number 2, 1943. 

, “The radical and semi-simplicity for arbitrary rings,” American Journal 

of Mathematics, vol.:67 (1945), pp. 300-320. 

W. Krull, “Ein neuer Beweis fiir die Hauptsitze der allgemeinen Idealtheorie,” 
Mathematische Annalen, vol. 90 (1923), pp. 55. 

——., “ Zweiseitige Ideale in nichtkommvwtativen Bereichen,” Mathematische 
Zeitschrift, vol. 28 (1928), pp. 481-503. 

——., “Idealtheorie in Ringen ohne Endlichkeitsbedingung,”’ Mathematische 
Annalen, vol. 101 (1929), pp. 729-744. 

———.,, “ Dimensiontheorie in den Stellenringen,” Journal fiir die Reine und Ange- 
wandte Mathematik, vol. 179 (1938), pp. 204-226. 

J. Levitzki, “ On multiplicative systems,” Compositio Mathematica, vol. 8 (1950), 
pp. 76-80. 

———., “ Prime ideals and the lower radical,” to appear in the American Journal 
of Mathematics. 

N. H. McCoy, “ Prime ideals in general rings,” American Journal of Mathematics, 
vol. 71 (1948), pp. 823-833. 

D. Murdoch, “Intersections of primary ideals in a non-commutative ring,” 
Bulletin of the American Mathematical Society, vol. 56 (1950), Abstract 
456. 

E. Noether, “Idealtheorie in Ringbereichen,” Mathematische Annalen, vol. 83 

(1921), pp. 24-66. 


700 i 
= 
[3] 
[4] 
| [5] | 
[6] 
[8] : 
[9] | 
[11] 
[12] | 
[13] 
[14] | 
[15] 
[16] 
(17] 


ty, 


TWO DECOMPOSITION THEOREMS FOR A CLASS OF FINITE 
ORIENTED GRAPHS.* 


By R. Duncan Luce. 


1, Introduction. The object of study in this paper is the class of finite 
oriented graphs which are subject to the conditions: 


i. at most two branches exist between any pair of nodes (vertices), and 
ii. whenever two branches do exist between a pair of nodes they shall 
have the opposite orientation. 


Such a system will be called a network. The justification for introducing 
this word is its wide use in those applied sciences where oriented graphs of 
this type are playing an important role; for example: electrical networks, 
sociometric networks or diagrams, abstract programs for digital computers, 
and the neural networks of mathematical biology. 

It is convenient to give a self-contained definition: A network N of order 
m is a system composed of two sets M and P, M being a finite non-empty set 
of m elements called the nodes of N and P a prescribed subset of the set of 
all ordered pairs of nodes. The members of P (i.e. the oriented branches) are 
called the links of N. The number of links of a network N will be denoted 
by p(NV), or simply by p when no ambiguity can arise. To indicate that N 
is of order m and has p links we shall write VN = N(m, p). Lower case Latin 
letters such as a,b, c,- - - will be used for nodes, and bracketed ordered pairs 
(ab), (ca),- - - to denote links. If (ab) is a link, the first node, a, will be 
called the initial node and the second, b, the end node of the link. 


* Received October 2, 1951. 

1 Several of the concepts defined in the introduction have been assigned terms by 
D. Konig, Theorie der Endlichen und Unendlichen Graphen, New York, Chelsea Pub- 
lishing Co., 1950. A brief glossary with page references to Kénig is presented: 


node = Knotenpunkt, p. 1 

link = Gerichtete Kante, p. 4 
disjoint = Fremd, p. 3 

are of a network = Zweifache Kante, p. 93 

link of the form (aa) = Schlinge, p. 3 

non-reflexive graph = Graph im engeren Sinn, p. 4 
circuit = Zyklus, p. 29 


chain which is not a circuit = Bahn, p. 30. 


701 


: 

ic8, 
3), 
Jet 
val 
e,” 
he 
he 
)), 
Lol 
cs, 
” 
ict 
83 


702 R. DUNCAN LUCE. 


A subnetwork N’ of a network N is a subset M’ of the nodes, M, of N, 
with P’ taken to be some subset (not necessarily proper) of those links of V 
which are definable on M’. If M’ = M, we shall say the subnetwork is complete. 
Two subnetworks of a given network are disjoint if they have no nodes, and 
therefore no links, in common. 

Each network is obviously a binary relation over a finite set, its nodes, 
and conversely every binary relation over a finite set can be interpreted as a 
network. This allows us to present all examples as relation matrices with 
entries 0 and 1 from the two element Boolean algebra. Furthermore, this 
suggests that if N and N’ are two networks over the same (or isomorphic) 
set of nodes M, then by N — N’ we shall mean the complete subnetwork of V 
having those links of N which are not links of N’. If N’ is a subnetwork 
of NV, and if N’ has the set of links P’, then by the network formed from N 
by the removal of the links P’, we mean N—WN’. If N’ has but one link, 
(ab), of N, then we shall write VN — N’ = N — (ab). 

We shall call a network non-reflexive if there are no links of the form (aa). 

In case both the links (ab) and (ba) are present in a network, we shall 
say that an arc ab exists between a and J, the arc consisting of this pair of 
links, each of which will be said to be a member of the arc. This terminology 
is justified by the fact that when every link is a member of an arc the network 
is isomorphic (in the obvious sense of the word) to a graph without 2-circuits, 
to use a term of Whitney *; this is what we shall mean by saying that a network 
is a graph. Observe that the arcs of a network WN are not the same as the 
branches (or arcs) of the graph which is oriented to form NV. A link of the 
form (aa) is always the arc aa. 

A (connected and oriented) g-chain from a to b is a set of q links of the 
form (a¢:), (¢:C2),* (Cg-2€g-1), (Cg), such that no node appears more 
than once, except in the case a = b where a appears twice. Any q-chain from 
a to b will be denoted by (ab, q). Observe that (ab,1) = (ab). Ifc isa node 
included in a q-chain from a to b, then we may subdivide the chain into the 
“ product ” of two chains, one from a to c, and the other from c to 8, i.e., 
(ab, q) = (ac, 9’) (cb,g—9’), 

An (oriented) circuit is a chain of the form (aa,q). A circuit of two 
links is an arc and conversely. 

A network is connected if there exists a chain from each node to every 
other node. A network which is not connected is disconnected. When N is 


* Whitney, H., “ Non-separable and planar graphs,” Transactions of the American 
Mathematical Society, vol. 34 (1932), p. 339. 


« 
A 
Ay 
, 


703 


A CLASS OF FINITE ORIENTED GRAPHS. 


treated as an oriented graph, connectedness is defined topologically; our defi- 
nition implies topological connectedness but is not implied by it. However, 
as applied to networks which are graphs, the two definitions are equivalent. 


2. A decomposition theorem for arbitrary networks. In this section 
we shall give four related definitions for networks of order m which will 
depend on the integers & between 0 and m. These definitions will be used in 
their full generality in Theorem 2.4, which shows that, in a certain sense, 
we need only consider the definitions in the case k—1. Consequently the 
rest of the paper will be, for the most part, devoted to that special case. 

First we need a measure of how easily a connected network is disconnected 
by the removal of links. We shall say a network is of degree 0 if it is not 
connected. A network is of degree k, 1 = k= ™m, if there exists a set of 
distinct links whose removal from the network will result in a complete sub- 
network of degree 0, while the removal of any set of q < & links results in a 
complete connected subnetwork.* The degree of a network is unique. 


Lemma 2.1. If N(m,p) is a network of degree k, then p= km. 


Proof. It will obviously suffice to show that each node is the initial node 
of at least k links. This is true, for if not, then the removal of the links for 
which such a node is the initial node will disconnect N. This contradicts 
the assumption that the degree is k. 


In addition to the concept of degree, we need a condition implying that 
there is an even distribution of connectedness throughout the network ; roughly, 
that the degree of any connected subnetwork is not greater than that of the 
network itself. That this is not always the case is evidenced by any graph 
formed of an m — 2 simplex, m = 4, and a single node joined by a single arc to 
one of the nodes of the simplex. The network is of degree 1, and the simplex, 
which is a connected subnetwork, is of degree m—2 = 2. A definition which 
will suffice is the following. A network is said to be k-minimal, 1 Sk =m, 
if the removal of any link results in a complete subnetwork of degree & — 1. 
The existence of such networks is proved in Lemma 2.3. A network is 
k-uniform if every connected subnetwork is of degree =k. If a network is 
1-uniform and connected we say it is uniform. 


LemMMA 2.2. If N is a k-minimal network with k = 2, then N is k- 
uniform and of degree k. 


* This definition of degree has no relation to the Grad defined by Kinig, op. cit., p. 3. 


vy, 

N 

te. 

nd 

as, 

a 

th 

Lis 

) 
N 

rk 

N 

k, 

). 

of 

ay 

8, 

rk 

e 

e 

e 

re 

le 

le 

0 

y 


704 R. DUNCAN LUCE. 


Proof. Let S be any connected subnetwork of N, and suppose it has 
degree d. Let (ab)e S. N— (ab) =N’ is of degree k —1, so in N’ there 
exists a set U of k—1 = 1 links, whose removal from N’ results in a complete 
disconnected subnetwork, NV”. If in N” there is a chain from a to b, we may 
replace (ab) and still have a disconnected network N* which is formed from 
N by removing the links of U. In that case, the removal of any (cd) eU 
from NW results in a complete subnetwork of degree k —2 = 0, since k = 2. 
This is contrary to the assumption that N is k-minimal, so there is no chain 
from a to 6. Thus, the removal of no more than & links, those of U which 
are in § and (ab), from S, implies a is not connected to b by any chain. It 
follows that d= k. 


Specifically, N has degreed=k. If d= k—1, then, since the removal 
of any link results in a complete subnetwork of degree k —1, it follows that 
d—k—1. Let U bea set of & —1 links whose removal from N results in a 
complete disconnected subnetwork. U is non-empty since k= 2. Remove 
(ab) eU from N. The resulting network is, by definition, of degree k —1; 
however, the remaining k — 2 links of U disconnect N — (ab). Hence d= k. 

We note that the above argument does not apply for k —1; in fact, any 
disconnected network is 1-minimal, since the removal of any link results in a 
complete subnetwork of degree 0. But some networks are both connected and 
1-minimal; these we shall call minimal. A minimal network is clearly non- 
reflexive and uniform. 


Lemma 2.3. If N is a network of degree k = 1, then for every integer q, 
1=qsk, there exists a complete connected subnetwork of N which is 
g-minimal. 


Proof. Let Cq be the set of all complete connected subnetworks of N 
having degree g. Since WN is finite and gk, it is obvious that Cy, is non- 
empty. Let 

Pa — max p(N—N’). 

Since WV is finite, there exists some NgeC, such that p(N—WN,) = py. 
Nz is, by choice, connected. Hence it will suffice to show that N, is g-minimal. 
Suppose the removal of some link does not result in a complete subnetwork 
of degree g— i. Then, since the removal of one link cannot lower the degree 
by more than 1, the resulting network WN’ has degree g. Thus N’eC, and 
p(N—WN,) < p(N—WN’) S pq which is contrary to choice. 

A complete connected subnetwork N’ of N such that, in the terms of the 


3 


. 
& 
4 


A CLASS OF FINITE ORIENTED GRAPHS. 705 


above proof, p(N — N’) = pag, is called a q-descendant of N. If Ng and N’, 
are two g-descendants of a network WN, then p(N,z) = p(N’,). It is clear that 
every connected network has at least one 1-descendant, but this is not generally 
true for g > 1. Because of their importance the 1-descendants will be called 
simply descendants. It is clear that a descendant is minimal. 

A network N will be called the sum of complete subnetworks Ni, 

t 
i=1,2,---,t, and written N => Nj, if each link of N is contained in 
i=1 
exactly one of the Nj. 

THEOREM 2.4 (first decomposition theorem). To every network N there 
exists a unique number k, its degree, and at least one set of k +-1 complete 
1-minimal subnetworks, Ni, such that 

k+1 
i N=>M, 
i=1 


ii. Nyy ts disconnected, 


iii, tf k=1, then N, ts minimal, 


j 
iv. > is a j-descendant of Ni,1SjSh, 
t=1 


_v. the connected subnetworks of the Ni, 1 Sik, are minimal, and 
so these networks N; are 1-uniform. 


Proof. By definition there is a unique degree k assigned to every network. 
If k = 0, then N is not connected and we are done. If k > 0, select, according 
to Lemma 2.3, a k-descendant N’; of N, and define Nz... N—WN’,. Nuss 
is not connected ; for if so, V is the sum of two complete subnetworks having, 
respectively, degree k (Lemma 2.2) and degree =1. This, we will show, 
implies that N has degree = & + 1, which is contrary to assumption. 


To show this we prove the slightly more general statement: If 
N=WN,+ Nz, and these networks have degrees k, k,, and kz respectively, 
then k=k,+k,. For, by definition, there exists a set U having & links, 
such that their removal from N results in a complete disconnected subnetwork 
N’, and this is not true for any smaller set. Of these & links, let uw, be in Nj, 
and u.in N.. By the definition of a sum, k =u, + us. Furthermore, u, = k,, 
since we may remove from N first the links of U and then the remaining links 
of V.. This complete subnetwork, which obviously is N, minus wu, links, 
is disconnected, so u; Similarly, wu. = whence the result. 

In the network N’;, select a (4 —1)-descendant, N’,,, and let N;, = N’, 


as 
re 
LY 
m 
U 
n § 
h | 
[t 
al 
ut 
a 
a and 
D 

j 
e 


706 R. DUNCAN LUCE. 


— N’,.. Ny is 1-minimal, for if (ab)e Ny, let N*,—N’,— (ab). Then, 
by the definition of k-minimal, N*; is of degree k—1. But since N*; con- 
tains N’,., the latter is a (k—1)-descendant of the former. Then the 
argument given above shows that N*; — N’,-. = Ni; — (ab) is not connected, 

The argument proceeds inductively without difficulty, since the last argu- 
ment is independent of *. When we get to the case N’2, N’;—WN, is a 
descendant of NV’, and thus is minimal rather than simply 1-minimal. 

Condition (iv) is satisfied by our choice of the Ni. 

Finally, the connected subnetworks of Nj, 2 jk are minimal. For 
suppose § is a connected subnetwork of N; such that S — (ab) is a connected 
subnetwork of S. Let N*;—=N’;— (ab). Since N’; is j-minimal, N*; is of 
degree 7 — 1, so there exists a set of 7 — 1 links whose removal from N*; will 
result in a complete disconnected subnetwork. At least one of these links is 
in S, since there exists, in S, a chain (ab,q) 4 (ab). Thus there are at most 
j —2 = 0 of these links not in N;, so that the removal of at most 7 — 2 links 
from the descendant N’;_, results in a complete disconnected subnetwork. This 
is in contradiction to Lemma 2.3 which shows that N’;_, is (7 — 1)-minimal. 
Thus § is minimal. If k=1, N, is minimal, and therefore the connected 
subnetworks are minimal. It follows immediately that these N; are 1-uniform. 

In the sense of this theorem, the study of an arbitrary network has been 
reduced to the study of a collection of 1-minimal networks. These 1-minimal 
networks are either connected, and so minimal, or disconnected. But a dis- 
connected network consists of isolated nodes, isolated chains, and connected 
pieces. For & of the subnetworks, part (v) shows that the connected pieces 
are minimal. If the theorem is applied repeatedly to the connected pieces of 
Nis, it may, in the same sense, be reduced to isolated nodes, isolated chains, 
and minimal subnetworks. Thus we may say that, in a sense, the study of 
any network may be reduced to the study of minimal networks. This 
exaggerates the present state of the art, since we do not know whether this 
decomposition is sufficiently strong to allow general conclusions about net- 
works, or even k-minimal networks, from a knowledge of minimal networks. 
In fact, an important unsolved problem is the relationship between two 
distinct decompositions of this type for a given network. That two distinct 
decompositions may exist is shown by: 


00101 00000 00101 00100 00001 
01010} = {01000} + | 00010} = | 00010} + | 01000 
01100 00100 01000 01000 00100 


| 10010 [00010 | 10000 00000 | 10010 


3 


A CLASS OF FINITE ORIENTED GRAPHS. 707 


On the basis of the preceding remarks we are led to devote the rest of 
this paper to beginning a study of minimal networks. Section 3 includes a 
decomposition of any minimal network and the deduction of several properties 
of minimal networks. These properties are used in section 4 to draw some 
conclusions about arbitrary connected networks. In section 5 we discuss the 
relationships between several of our concepts and that of a tree in graph 
theory. Finally, in the last section, we present an interesting inequality, and, 
from this, define a subclass of minimal networks, the members of which are 
shown to have a particularly simple form. 


3. A decomposition theorem for minimal networks. This section pre- 
sents a decomposition theorem for any minimal network, which may be used 
to show that there exists a close connection between the concept of a minimal 
network and the concept of a tree in graph theory. We note first that a net- 
work which is a tree is minimal. However, the class of minimal networks is 
much wider than that, for we know that every connected network has a 
descendant, which is minimal, and we have 


LemMaA 3.1. If N is a connected network and T a descendant of N, 
then T 1s a tree only if T=N. 


Proof. If T is a tree and TN, then JT must have been formed from 
N by the removal of at least one link. Reintroduce one of these, say (ab), 
into 7. This must introduce an oriented circuit on at least three nodes, since 
T is connected; let it be (ab) (bc,)- - + (cga). Since this circuit is on three 
or more nodes, and the links of 7 are members of arcs, it follows that 
(ab, = (CgCg-r)* * (C10), 2, exists. Because of the existence 
of the circuit, this chain may be removed to result in the complete connected 
subnetwork NV’. Now since g= 2, p(N—WN’)> p(N—T), so T is not a 
descendant of NV, which is contrary to assumption. 


To carry further the work of this section we need two more definitions. 
First, a network is a compound circuit of order 1 if it is simply a non-reflexive 
oriented circuit on its nodes; assuming a compound circuit of order s—1 
defined, a compound circuit of order s is formed from one of order s —1 by 
replacing some node c of that network by a non-reflexive circuit C, each link 
of the form (ac) by one and only one link of the form (ac’) where c’ eC, 
and each link of the form (ca) by one and only one link of the form (¢c’a), 
e’eC. We shall refer to this as an inductive composition of a compound 
circuit. Obviously any compound circuit is connected; furthermore, we have 


12 


en, 

on- 

he 

ed, 

or 

ed 

of 

vill 

| 

ost | 

ks | 

his 

al. 

ed 

pen 

nal 

lis- 

ed 

ces 

of 

ns, 

of 

his 

his 
et- 

ks. 

wo 

net 


708 R. DUNCAN LUCE. 


Lemma 3.2. If N(m,p) is a@ compound circuit of order s, then 
s=p—m-+1;1.¢., if N is formed by orienting the graph G, s ts the first 
Betti number of G.* 


Proof. Let N be formed by the inductive composition of the circuits C;, 
i= 1,2,---,s8, in their natural order, and suppose each C; has m; nodes 
and hence the same number of links. It follows by a simple induction that 


m= mMm,—1+ -+ m,=3m— (s—1) 


and p = Sp; = Sm, so that s = p— m+ 1. It is well known that p—m +1 
is the first Betti number of any connected graph having p branches and m 


nodes. 
A connected network N will be said to be reducible into subnetworks N, 


and if: 
i. N, and Nz are disjoint, 


ii. N, and Nz are each either connected or consist of a single node, 

iii. there exists a network N’, formed of N, and N» joined by exactly 
one link from N, to N, and exactly one from WN, to N,, such that 
N’ =N. 


If a connected network is not reducible it is called irreducible. 


THEOREM 3.3. <A connected network which is a graph 1s reducible if 
and only if it is of degree 1. 


Proof. It is clear that any reducible network is of degree 1, since we 
may disconnect it by removing either of the links joining the disjoint sub- 
networks. 


Let NV be a graph of degree 1 and (ab) a link such that N — (ab) = N’ 
is disconnected. Evidently, in N’ there is no chain from a to b. Define M, 
to consist of b and any nodes b’ such that there is a chain from 0b’ to b in N’. 
Let Mg = Clearly, ae Mg. For any a’e Mg, a’ a, there exists 
in N’ a chain from a to a’. If not, then, since N is connected, any chain in 
N from a to a’ must contain the link (ab), and at least one such chain exists. 
Since the node a can appear only one, this chain must be of the form 
(aa’, q) = (ab) (ba’, gq —1), and (ba’, g —1) does not contain a. But N isa 
graph, so (ba’,g—41) implies the existence of a chain (a’b,qg—1) which ; 
does not contain (ab). This, then, is a chain in N’, and so a’ e My, which is 


* Kénig, ibid., first Betti number = Zusammenhangszahl, p. 53; Whitney, op. cit., 
first Betti number = nullity, p. 340. 


4 


A CLASS OF FINITE ORIENTED GRAPHS. 709 


contrary to choice. Thus we know a chain exists in N’ from a to a’. Since 
N is a graph, this implies that the largest subnetworks of NV defined on M, 
and M, are each either connected or consist of a single node. Now, if a’eM, 
and b’e My, where a’b€a or b’ =<), then there exists no link of the form 
(a’b’), for otherwise there would be a chain from a to 6 in N’. Since N is a 
graph, it follows that there are no links of the form (6’a’). Thus W is 
reducible. 
The following is our principal result: 


THEOREM 3.4 (second decomposition theorem). To any minimal network 
N, which is not a tree, there exist integers t=1 and y= 0, such that N con- 
sists of ¢ disjoint irreducible compound circuits Ci, 1—=1,2,---,t, and 
y nodes 1=1,2,-- +, y, not included in the Ci, 1SiSt, subject to 
the conditions : 


i. there exists at most one link from any C; to any C;, 1+4j, 1 S4, 
jStt+y; 
ii. no arc ts contained in any of the C;,, 1SisSt; 
ili. the network formed by treating the C;, 11S ft, as nodes, all other 
nodes and links remaining unchanged, is a tree, or, if t—=1 and 
y = 0, a single node. 


Proof. This proof will be carried out in two stages. First we shall show 
that if N is a minimal network in which there exists an are ab, then N is 
reducible into two subnetworks joined only by ab. Since N is minimal, it is 
non-reflexive, so that as<b. Define the set of nodes M, to consist of a and 
any other nodes, a’, of N such that there exists a chain from @ to a’ which 
does not include the link (ab). Let M,—=M—M,. be M,, for if not, then 
be M,, and so there exists (ab, q) not including (ab). Then N — (ab) isa 
connected subnetwork of NV; this violates the condition that N is minimal. 


We shall now show some properties N must satisfy which will lead 
ultimately to a proof of the statement: 


1. If a’e Ma, a’ Aa, there exists a chain from a’ to a not including the 
link (ba). Clearly some chain exists from a’ to a, since N is connected. If 
all such chains include (ba), then, since the node a may only appear once, 
each of them may be written in the form (a’a, q) = (a’b,qg—1)(ba). More- 
over, (a’b,¢—1) does not include (ab) since a’ ~a. Now, by the definition 
of M,, there exists a chain (aa’,u) which does not include (ab), so that 
(aa’, w) (a’b, g —1) does not include (ab). This is contrary to the assumption 
that NV is minimal. 


en : 
rst 
ty 
les 
at 
1 
m 
tly 
hat 
if 
we 
ub- 
N’ 
M, 
N’ 
ists 
in 
sts. 
rm 
sa | 
ich 
is | 
cit., 


R. DUNCAN LUCE. 


2. Let b’ bd. b’ eM, if and only if there exists a chain from 6 to 0’ 
which does not include the link (ba). Suppose first that b’e My. Since NV 
is connected, there exists at least one chain from b to b’. Suppose each 
(bb’,q) contains (ba). Then, since each node may appear only once, 
(bb’, q) = (ba) (ab’,g—1). If (ab’,qg—1) does not contain (ab), then, 
by definition, b’¢ M,, which is impossible. But (ab’,q—1) cannot contain 
(ab), for if it did, then (bb’,q) would not be a chain. Hence (bb’,q) does 
not contain (ba). 

Conversely, suppose there exists a chain from 6 to b’ not including (ba). 
If b’ ¢ My, there exists, by 1, a chain from 0’ to a not including (ba) ; these 
two combine into a chain from 6 to a not including (ba), which is contrary 
to NV being minimal. 


3. If b’e My, b’ ~}, there exists (b’b, q) not including (ab). We may 
parallel the proof of property 1 by replacing the words “ definition of M,” 
by “ property 2.” 

4. If a’e Mg, b’e My, and either aa’ or Dd’, then no link of the 
form (a’b’) exists. Suppose such a link does indeed exist. Then, by the 
definition of M,, there exists a chain (aa’,q) which does not include (ab), 
and, by property 3, a chain (6’b,7r) which does not include (ab). Thus the 
chain (aa’, q) (a’b’) (b’b, r) does not include (ab), since (a’b’) ~ (ab), which 
is impossible. 

5. Under the same conditions as in 4, there is no link of the form (6’a’). 
The argument is exactly parallel to that of 4, using properties 1 and 2. 


It thus follows that the maximal subnetworks of N on the sets M, and 
M, are each either connected and minimal, or consist of a single node. From 
4 and 5, one concludes that the subnetworks are joined only by the arc ab. 
This exhausts NV, and the result is proved. 

Since an arbitrary network has a finite number of arcs, it follows from 
a finite number of applications of the above result that a minimal network 
which is not a tree consists of a set of ¢? =1 disjoint arc-free minimal sub- 
networks Ci, i=1,2,---,t, and y’=0 nodes i= 1, 2,- + y’, not 
included in the C;, 1 =i ?’, such that: 

i. any link not in a C;, 1=iSV, is a member of an arc; 

ii. there exists at most one arc between any C; and Cj, 147, 1 St, 

iii. the network formed by treating the C;, 1 = 1S ?’, as nodes, is a tree; 


the decomposition is unique. 


A CLASS OF FINITE-ORIENTED GRAPHS. 711 


By virtue of this decomposition, the problem is reduced to examining the 
case of an arc-free minimal network. We show: An arc-free minimal network 
consists of = 1 disjoint irreducible compound circuits Ci, i= 1, 2,---, t”, 
and y” = 0 nodes Cie, i= 1, 2,- - y”’, not included in the C;,, 1 SiS”, 
such that: 


i. there exists at most one link from any C; to any Cj, 14), 1 Si, 
jst 

ii. the network formed by treating the Ci, 117”, as nodes is a 
tree, or, if ¢” —=1 and y” —0, a single node. 


The first arc-free minimal network occurs for m= 3, and this obviously 
satisfies the conditions since it is a circuit on three nodes. Suppose now that 
the statement, except for the condition that the compound circuits are irre- 
ducible, has been proved for all networks through m—1 nodes, and let 
N(m, p) be an arc-free minimal network. In WN there exists a circuit con- 
sisting of at least three links, since NV 1s connected, arc-free, and non-reflexive ; 
let C be one such circuit on the nodes cj, 11, 2,---,qg=3. The maximum 
subnetwork of NV on these nodes is only C, for if there exists any other link 
(cic,), k>41-+1, then this link can be removed without disconnecting N, 
since the chain (CjCis1) (CisiCis2) * exists. This is impossible since 
N is minimal. Now, if C exhausts all the nodes of N we are done. If not, 
let M’ be the set of nodes remaining. If ae M’, and (ac,), Sq, exists, 
then no link of the form (ac), 1 Si Sq, 1k, exists. For if so, then the 
chain from c; to c;, which is a part of C, and so does not include (ac;), shows 
that (ac;) may be removed without disconnecting N. This is impossible. 
Similarly, if (c,a), 1 Sk Sq, exists, then (ca), 1 SiSq,1~k, does not 
exist. 

Now consider the network N’ formed by letting the nodes of C coalesce 
into a single node which we shall call c. Evidently, since NV is minimal, so is 
N’, and N’ has at least two nodes fewer than N, since g= 3. Several possi- 
bilities exist for N’. First, it may be a graph, and hence a tree, in which 
case the statement is proved. Second, it is not a tree, but there exists at 
least one arc. By the first part of this theorem, N’ may be decomposed into 
several arc-free minimal subnetworks connected in such a fashion that if they 
are treated as nodes, the resulting network is a tree. By the induction hypo- 
thesis, these arc-free minimal subnetworks satisfy the conditions of the 
statement we are proving. But replacing the node ¢ by C, reconstructing N, 
only increases the order of one of these compound circuits, or introduces a 
new compound circuit, so the result is true for N. Third, WN’ is arc-free, in 


N 
ach 

ce, 
en, 
ain 
loes 
a). 
ich 
’) 
and 
om 
ab. 
om 
ork 
ub- 
not 
4 
ee; 


712 R. DUNCAN LUCE. 


which case the induction hypothesis may be applied directly, and the intro- 
duction of C for c only increases the order of the compound circuit. 

Thus, we may decompose WN into several compound circuits and nodes not 
in these compound circuits and connecting links satisfying the conditions i 
and ii of the second intermediate statement. Carry this decomposition as far 
as possible; the process will terminate in a finite number of steps, since N is 
finite. We will show that the resulting compound circuits are irreducible. 
For suppose that C; is reducible into the disjoint subnetworks A and B 
connected by the links (ab), (b’a’), a,a’e A, b,b’e B. By condition ii it 
follows that any Ci, 1 SiSt/+y’ is. linked “symmetrically ” to 
if at all. In fact, it is either linked symmetrically to A or to B; for if not, 
then there exists a link from A to C; and a link from (; to B, in which case 
(ab) may be removed, or, in the other case, (b’a’) may be removed without 
disconnecting N. This is impossible. A and B are either compound circuits 
or, by the result proved for arc-free minimal networks, may be reduced to 
several compound circuits and nodes not in them such that i and ii hold. 
By an argument similar to the one just made, the conditions i and ii hold 
for N with this finer decomposition. This is contrary to choice, so C; must 
be irreducible. 

The proof of the theorem follows almost immediately from the two inter- 
mediate results, if we note that the last argument may be applied to show 
condition ili. 

This decomposition of a minimal network is not unique, for 


000001 


010100 
000010 
001000 
| 100010 J 


may be decomposed into either a tree consisting of one arc, or one of two arcs. 

The next result gives a little more information about the components 
into which we have decomposed a minimal network, the irreducible compound 
circuits. This result is unsatisfactory in the sense that it does not give a 
complete characterization of these networks. For this proof and succeeding 
results we need the following definition. A node is simple if it is the initial 
node of exactly one link and the end node of exactly one link. 


THEOREM 3.5. Let N be a minimal network. WN is irreducible if and 


4 
| 
| 4 
q 
% 
4 


A CLASS OF FINITE ORIENTED GRAPHS. 713 


only if it is a@ compound circuit such that in any inductive composition of N, 
none of the circuits introduced are arcs. 


Proof. Suppose that at some stage of the composition of N, an arc ab is 
introduced into a compound circuit C to form a compound circuit C’. If ab 
does not have a simple node, then in C’ there exists either a chain from a to b 
not containing (ab), or one from b to a not containing (ba), since such a 
chain exists in C. The introduction of further circuits can only lengthen 
this chain, so N — (ab) is a complete connected subnetwork, which is im- 
possible. Hence a node of ad is simple. The introduction of further circuits 
merely adds to C to form a larger compound circuit, and hence a connected 
subnetwork or a single node of N, or it may replace the simple node of ab 
by a compound circuit. Between these two connected subnetworks, or single 
nodes, are only the links arising from ab, now no longer an arc in general. 
Thus WV is reducible, which is contrary to assumption, proving that no arc 


can be introduced. 


Conversely, if we suppose WN is reducible, then Theorem 3.4 implies V 
may be decomposed into one or more irreducible compound circuits and nodes 
not included in these compound circuits. The circuits of any compound 
circuit C may be coalesced into nodes in the inverse order of an inductive 


composition of C. This clearly leads to the tree of Theorem 3.4. But any 
tree is a compound circuit formed only of arcs. Thus we have an inductive 
composition of N involving arcs, the arcs of the tree. As this is contrary to 
assumption, N must be irreducible. 

The principal theorem will be utilized sometimes through two properties 
of minimal networks derivable from it. They are presented as 


THEOREM 3.6. A minimal network is a compound circuit which contains 


at least two simple nodes. 


Proof. The last part of the above proof suffices to show that a minimal 


network is a compound circuit. 


To show that a minimal network has two simple nodes, we shall perform 

an induction on the order s of the compound circuit. It is certainly true for 
= 1, since the compound circuit is then a non-reflexive circuit. Assume the 
result true up through circuits of order s—1. Suppose W is a minimal 
compound circuit of order s, and let C be the last circuit introduced in some 
inductive composition of N. Let C coalesce into a single node c, and call the 
resulting network N’. WN’ is readily seen to be minimal and of order s —1, 


not 
ns j 
far 
N is | 
ble. | 
1B 
iit 
not, 
ase 
lout 
uits 
old. 
old 
ust 
er- 

nts 
nd 
ng 
ial | 
nd 


714 R. DUNCAN LUCE. 


so by the induction hypothesis it contains at least two simple nodes. If two 
of these are different from c, then we are done. If not, c is simple. Consider 
N; if C is not an arc then it must introduce a simple node, for C has at least 
three nodes, and there exists only one link to C from the rest of the nodes, 
and only one from (, since c is simple. If, on the other hand, C is an are, 
then the first argument in the proof of Theorem 3.5 shows that one of its 
nodes is simple, and the result follows. 

That not every compound circuit is minimal or has a simple node is 


shown by: 
0101 
0010 
0101 
1000 


4, Applications to connected networks. Two applications to connected 
networks are given of the results on minimal networks; the first examines 
limits on the number of links a connected network may have, and the second 
discusses the maximum number of “independent ” circuits a connected net- 


work may have. 


THEOREM 4.1. Let N(m,p) be a connected network, not a tree. Let 
N have a descendant N’ which is decomposable in the terms of Theorem 3.4 
into t irreducible minimal subnetworks and y nodes not in these subnetworks. 
Then 


p< (8m +t+y—4)/2 + p(N—N%) < 2X(m—1) + p(N—N’. 
If N is a tree, p= 2(m—1). 
Proof. If N is a tree, the result is well known from graph theory.® 


Suppose W is not a tree. Then it is sufficient to show the result for the 
class of minimal networks which are not trees, since, in the general case, the 
network NV has p(N —N’) more links than any descendant NV’. By Lemma 
2.3 a descendant is minimal, and, by Lemma 3.1, it is not a tree. So we 
consider V minimal. Decompose WN as in Theorem 3. 4, and let the irreducible 
compound circuits C; have m; nodes, p; links, and order s; Let there be p’ 
links not in any irreducible compound circuit. Then, by the result on trees, 
p’ = 2(t + y—1). For each of the irreducible minimal subnetworks, Theorem 


’ Whitney, ibid., pp. 340-341. 


| 

4 

4 


A CLASS OF FINITE ORIENTED GRAPHS. 715 


3.5 implies that each of the s; circuits used in forming C; has at least three 
nodes, so that m=3+2+2+---+2—25%,+1. By Lemma 3.2, 


ast 

= 

re, Thus, 

its p= spit p’ S 33(m—1)/2 + 2(¢+y¥—1) 

= (3/2) (Smi+y) + (¢+y—4)/2 = (8m + t+ y—4)/2. 
is 


This may be simplified a little by noting that each of the irreducible minimal 
subnetworks must have at least three nodes, so m = 3¢ + y; hence, 


p< (8m + t+ y—4)/2 = (4m — 4—2t + ++ y—m)/2 
< 2(m —1)—t < 2(m—1). 


This concludes the proof. 
It is clear that in a given network we may define the addition of chains 


(mod 2). Thus we may also define linear independence (mod 2). We shall 

nd be concerned with sets of linearly independent (mod 2) circuits such that no 

ok other linearly independent set contains a greater number of circuits. These 
sets will be called maximal. The result proved in the next theorem is, in 
statement, formally the same as a result of graph theory: ° 

. THEOREM 4.2. In any connected network N(m, p) there exists a mazi- 

* mal set of p—m +1 linearly independent (mod 2) circuits. 

Proof. First, it is sufficient to show this for minimal networks. For, 
if VN is not minimal, then it has a descendant N’ which is. N may be con- 
sidered to be formed from WN’ by the addition of links one at a time. Each 
such link adds at least one new circuit which is independent (mod 2) of the 
circuits of the network to which it was added, since in a connected network 
every link is contained in at least one circuit. Thus, if there exists a set U’ 

he of p—K—m-+1, K = p(N—N’), linearly independent (mod 2) circuits 
he in NV’, there will exists a set U of at least p—m- 1 linearly independent 
na circuit in NV. 

gia Furthermore, if U’ is maximal in the descendant, U will be in N also. 
" If not, then there is a first subnetwork, N*, for which any set of p* —m-+1 
P linearly independent (mod 2) circuits is maximal, and to which the addition 
PS 


of a link (ab) produces a linearly independent set U” having more than 


° Lefschetz, Solomon, Introduction to Topology, Princeton, Princeton University 
Press (1949), p. 71. 


i 


716 R. DUNCAN. LUCE. 


p* —m circuits. It is clear that this set UV” must contain at least two circuits 
which include the link (ab), for otherwise the subset of U” in N* would 
contain more than p*— m-+-1 linearly independent circuits. Let two of 
the circuits be denoted by (ab) (ba,q) and (ab)(ba,q’). Since N* is con- 
nected and does not contain (ab), there exists a chain from a to 6b not 
including (ab); select a shortest: (ab, q”), q’ >1. In general, (ab, 7’) 
will coincide with (ba,q) over a certain number of links, i.e., over a set of 
several chains of the form (cd,t), each a part of (ba,q). The argument 
does not change in principal, and a great saving in notation is gained, if we 
assume that at most one such chain occurs. Similarly, (ab,q’) will be 
assumed to coincide with (ba, q’) over the chain (c’d’,t’). Furthermore, we 
shall assume that (cd,¢) and (c’d’, t’) have no links in common; if they do, 
a slight modification of the following argument will suffice. So we may write: 


(ab, q’’) = (ac, u)(cd, t)(dc’, z)(c’d’, t’)(d’b, v), 
the order of (cd, t) and (c’d’, t’) being immaterial. 
(ba, q) = (bc, x)(cd, t)(da, y), (ba, q’) = (bc’, 2’)(c’d’, t’)(d’a, y’). 
Observe that the following formal products are in fact circuits of N*: 
A: (ac, u)(cd, t)(da, y) B: (d’b, v)(bc’, x’)(c’d’, t’) 
C: (ac, u)(cd, t)(de’, z) (c’d’, t’) (d’a, y’) 
D: (bc, x)(cd, t)(dc’, z)(c’d’, t’)(d’b, v). 


Since these are circuits of N*, they are expressible (mod 2) in terms of the 
circuits in U’. But observe that (ab)(ba,q) + A+ B+ C+ D= (ab)(ba, 7’) 
(mod 2) so that (ab)(ba,q) and (ab)(ba,q’) are not linearly independent. 
Thus only one of them can be in U”, and so we have shown that if the theorem 
is true for minimal networks it is true in general. 

The minimal case will be proved by induction on m. For m= 2 it is 
trivially true. Suppose it is true for all minimal networks having m —1 
or fewer nodes, and let N(m,p) be minimal. By Theorem 3.6, N has a 
simple node a which is the initial node of only one link, (ab), and the end 
node of only one, (ca). We may distinguish three cases: 


i. b=c. Remove a and the arc ab, leaving the subnetwork N’(m — 1, 
p—2). WN’ is obviously minimal, and so it has a maximal set of 
p—m linearly independent (mod 2) circuits. The are ab adds 

exactly one circuit to this set. 


thy 


A CLASS OF FINITE ORIENTED GRAPHS. 717 


ii. bc, and there does not exist (cb, q) (ca) (ab). Remove a and 
the adjoining links and introduce the link (cb) to form N’(m—1, 
p—1), which is minimal. Thus, by the induction hypothesis, has 
a maximal set of p—m-+1 linearly independent circuits. But 
forming N’ from N cannot essentially change any set of linearly 
independent circuits, since the chain (ca)(ab) is, in this case, 
formally the same as (cb). 


iii. b=4c, and there does exist (cb,q) ~(ca)(ab). Again remove a 
and the adjoining links to form N’(m — 1, p — 2), which is minimal. 
By the induction hypothesis, NV’ has a maximal set of p— m linearly 
independent circuits. Replacing a and the two links (ab) and (ca) 
to form N adds one or more new circuits, depending on the number 
of chains from 6 te c. This situation is not essentially different 
from the one discussed in the first part of this proof, except that 


we are adding a 2-chain and a new node, rather than a single link. 
Since this node is simple, the argument is formally the same, and 
it shows that there is a maximal set of p—m- 1 linearly inde- 
pendent (mod 2) circuits in N. This, then, concludes the proof. 


We note the trivial corollary: A connected network N(m, p) has exactly 
p—m-+ 1 circuits if and only if the set of all circuits of N is linearly 
independent (mod 2). 


5. Generalizations of a tree. We shall show in this section that several 
of our definitions, when applied to networks which are graphs, are identical 
with the concept of a tree. This can be shown directly and easily in each case; 
however we shall first prove two results which are true in general, and then 
we shall use them to prove Theorem 5.3. Thus, that result is not as deep as 
it first appears to be. 

We shall call a connected network N(m,p) having exactly p—m-+1 
circuits circuit minimal. This definition makes sense because of Theorem 4. 2. 
By the corollary to that theorem, N is circuit minimal if and only if N is 
connected and the set of all circuits is linearly independent (mod 2). 


Lemma 5.1. A circuit minimal network is uniform. 


Proof. Let N(m,p) be circuit minimal. Let (ab) be any link, and 
N— (ab) =N’. If N’ is not connected, then N has degree 1. If N’ is 
connected it is circuit minimal, for at least one circuit of N was destroyed 
by the removal of (ab), and according to Theorem 4.2, no more than one. 


d 
t 
) 
f 
t 
e 


718 R. DUNCAN LUCE. 


Since N’ is connected, there exists at least one chain from Bb to a, but only 
one, for if there were more then the addition of the single link (ab) would 
introduce more than one circuit, and N would have more than p—m-+1 
circuits. In N, interrupt the chain from b to a by removing a single link 
from it, thus disconnecting VN. This proves W is of degree 1. 


To show WN is uniform it will thus suffice to show that every connected 
subnetwork is circuit minimal. If S is a connected network which is not 
circuit minimal, then there exists a circuit in § which is linearly dependent 
(mod 2) on the other circuits of S. This remains true in N, so N is not 
circuit minimal, a contradiction. 


Lemma 5.2. A compound circuit 1s uniform. 


Proof. This may be demonstrated by an induction on the order of 
compound circuits. It is trivially true for compound circuits of order 1. Let 
N be a compound circuit of order s > 1, let C be the last circuit introduced 
in some inductive composition of N, and let S be any subnetwork of N. 
Coalesce C into a single node c, and under this operation let S become 8’. 
If 8’ is a single node, then S = C, and the degree of Sis 1. Otherwise, 8’ is 
connected, and therefore, by the induction hypothesis, it is of degree 1. Select 
(ab) e S,¢C, such that 8’ — (ab) is not connected. This is possible since S’ 
is of degree 1. Now the introduction in S’ of that part of C in S, subject to 
the conditions of NV, can only replace the node c by a chain or a circuit, but 
cannot introduce a link or a chain from a to b; thus S is of degree 1. So V 


is uniform. 
The next result is the justification for the title of this section. 


THEOREM 5.3. For a connected network N which is a graph, the 
following are equivalent: i. N is a tree, ii. N is minimal, iii. N is a compound 
circuit, iv. N is uniform, v. N is circuit minimal. 


Proof. i. implies ii trivially. ii. implies iii by Theorem 3.6. iii. implies 
iv by Lemma 5.2. iv. implies i. For if N is not a tree, then there exists a 
circuit in the sense of graph theory. But this is clearly a connected sub- 
network of degree 2, so that NV is not uniform. v. implies iv by Lemma 5. 1. 
i. implies v. If N(m,p) is a tree, it follows, from theorem 3.5, that 
p—(m—1)=—m—1. Furthermore, the only circuits in a tree are 2-circuits 
(ares) of which there are exactly m—1, so the number of circuits is 


p—m-+1. 


The several results of this section and Theorem 3. 6 suggest the following 


ld 


ot 


A OLASS OF FINITE ORIENTED GRAPHS. 719 


class of unsolved problems: Conditions on a uniform network that it be a 
compound circuit. Conditions on a uniform network that it be circuit minimal. 
Conditions on a compound circuit that it be minimal. These four concepts 
are indeed all distinct. The network. 


010000) 
001001 
000100 
100010 
010000 
000100 | 


is minimal, and hence a compound circuit, but not circuit minimal. The 
network 

0101 

1000 

1101 

1010 


is both uniform and circuit minimal, but not a compound circuit. The network 


1010 
1001 
1000 


is uniform, but neither a compound circuit nor circuit minimal. The example 
following Theorem 3.6 shows that not every compound circuit is minimal. 


6. Rank minimal networks. In section 1 we noted the representation 
of networks by relation matrices with entries from the two-element Boolean 
algebra. Equally well, we may interpret this as a representation by real 
matrices with the numbers 0 and 1 as entries. Thus, since it is well known 
that matrix rank is a similarity invariant, to each network there is a uniquely 
defined number r, 1 [r= ™m, called the rank of the network, which is the 
rank of any of the corresponding real matrix representations. 


THEOREM 6.1. If N(m, p) is a connected network? having rank r, then 
ptr= 2m. 


7 Luce, R. D., “ Connectivity and generalized cliques in sociometric group struc- 
tures,” Psychometrika, vol. 15 (1950), pp. 169-190. In this paper the diameter, n, of a 


connected network was defined as n = max min (ab,q), and it was conjectured that 
a,beM q 


p+n=>2m. This is now known to be false; however, Theorem 6.1 is a correct result 
which is closely related to the conjecture, for it may also be shown that r>n. 


ily 
= 
nk 
ed 
ot 
nt 
_| 
of 
et 
ed 
N. 
is 
ct ‘ | 
y 
| 
to 
Le ; 
d | 
)- 
t 


720 R. DUNCAN LUCE. 


Proof. Suppose p+ r< 2m. Select any set R of r linearly independent 
rows in a particular matrix representation. Since WN is connected, there exists 
a non-zero entry in each column j; but since each row can be written as a 
linear combination of rows from R, it follows that for each column 7 there 
exists an 1¢ R, such that the ij entry is 1. Thus, in the rows of RF there 
are at least m 1’s. By our assumption, there remain p’=p—m<m—r 
links (entries that are 1). Each of the m—r rows not in R must con- 
tain a non-zero entry, since N is connected, and therefore p’ = m—r, a 
contradiction. 


We shall call a network N(m,p) rank minimal if it is connected, and 
p+r=2m. 


THEOREM 6.2. If a connected network ts rank minimal, tt is minimal. 


Proof. As in the proof of Theorem 6.1, we consider a matrix repre- 
sentation N of the rank minimal network N, and let RF be a set of r linearly 
independent rows. Each column has a non-zero entry in some row of R, 
since N is connected. The set R’ of the m—r remaining rows must have 
a non-zero entry in each row for the same reason. However, since p = 2m — 1, 
it is necessary that R have exactly one non-zero entry in each column, and fF’ 
exactly one in each row. 


Let (ab) be any link of the network. We shall show that its removal 
results in a disconnected network, which will prove the theorem. 

If ae R’, then the removal of (ab) results in a network N’ having no 
link for which a is the initial node, since the rows of R’ have exactly one 
non-zero entry. 

If ae R, then either the row a has only one non-zero entry, and we use 
the above argument, or it has another non-zero entry, say in column c, cD. 
We show that in the latter case column b has only the one non-zero entry, Na». 
For suppose another link (db), da, exists. Then de R’, for we showed 
above, essentially, that the rows of RF have exactly one non-zero entry in each 
column, and we have assumed (ab) to exist and ae R. Since the rows of PR’ 
have exactly one non-zero entry, it follows that Na» is the one for row d. 
But since the rows of FR are a set of linearly independent ones for this matrix, 
row d must be a linear combination of rows of R. The row a must be in this 
combination, as it is the only one of FR having an entry in the 0 column. 
However, we assumed that row a has a non-zero entry in column ¢. This 
must be subtracted, since row d cannot have an entry N4-—1. But this is 
impossible using only rows of FR, since no other row of # has an entry in the 


| 


A CLASS OF FINITE ORIENTED GRAPHS. 721 


ce column. This contradiction implies that column has only 1, and 
so NV — (ab) is a complete disconnected subnetwork of VN. Thus N is minimal. 
The converse statement is not true, as will be obvious from a comparison 
of Theorem 6. 4 and Theorem 3. 4. 
The next lemma will be used in conjunction with Theorem 3. 4 to decom- 
pose any rank minimal network. 


Lemma 6.3. Let N be rank minimal. If N is reducible into the sub- 
networks N, and No, then either N, or Nz is a single node. 


Proof. Let the Ni, 11,2, have m; nodes, p; links, and rank 7;; and 
let m, p, and r denote the corresponding quantities in N. If neither of the 
N; is a node, they are both connected subnetworks, so by Theorem 6.1, 
pi = 2m,—7;. It is evident from the definition of a reducible network that 
p=pPit pe +2, and m=m,-+ m2. Furthermore, if we let the matrix minor 
repersentation of N; be denoted by the same symbols, we then have, for an 
appropirate labeling of the nodes, the following type of matrix representation 
for NV: 


( 


N. 


J 


whence one sees that r=r,+7r,—1. Thus, p=p,+ p2+2 = 2m,—r; 
+ 2mz— re + 2 = 2(m, + me) — (7, + 72 —1) + 1 > 2m —1, which is con- 
trary to the assumption that N is rank minimal. 


A tree such that the arcs all have one end node in common is called a 
star. It is not difficult to show that a star is rank minimal. 


: 
ent 
ists 
ere 
ere } 
-+ 

nd 
al. 
re- 
rly | 
Rk, 
ve 
no | | 
| 
fo...0...0 
od | 2 
d. 
n. 
is 


722 R. DUNCAN LUCE. 


THEOREM 6.4. Let N be a rank minimal network which is not a star, 
If N has any ares, there exists one arc-free rank minimal subnetwork C of N, 
such that the network formed from N by treating C as a node, all other 
links and arcs remaining unchanged, is a star. An arc-free rank minimal 
network C consists of exactly one irreducible rank minimal subnetwork C’, 
not a single node, and, possibly, some simple nodes such that the network 
formed from C by treating C’ as a node, all other links and nodes remaining 
unchanged, is a star. Furthermore, if C’ is the trreducible rank minimal 
subnetwork of the arc-free rank minimal subnetwork C of N, then the net- 
work formed from N by treating C’ as a node, all other links and nodes 


remaining unchanged, is a star. 


Proof. In the first case, apply Lemma 6.3 to the first statement pre- 
sented in the proof of Theorem 3. 4 to show that each are which exists must 
have a simple node. The arc-free subnetwork C is rank minimal, for the 
removal of any arc from WN results in a network N’ for which p’ = p— 2, 
m’ =m —1, and r’ <r, implying p’ = 2m’—r’. Thus, by Theorem 6. 1. 
N’ is rank minimal. To C, first apply Theorem 3.4 and then Lemma 6.3 
to show any nodes, not in an irreducible subnetwork, must be simple, and 
there is only one irreducible subnetwork C’. The same argument as applied 
above suffices to show that C’ is rank minimal. The final statement is proved 
in the same manner. 


In conclusion one may mention two more unsolved problems: Conditions 
on a minimal network that it be rank minimal, and a characterization of an 


irreducible rank minimal network. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 


3 
i 
x 


ns 
un 


ON THE NON-VANISHING OF CERTAIN DIRICHLET SERIES.* 


By AUREL WINTNER. 


Let f(n), where n —1, 2,- - +, be a completely multiplicative function, 
that is, let f(mim2) =f (m)f(n2) but f(1) #0. Such a function is uniquely 
determined by an arbitrary assignment of the values f(p), and is a bounded 
function if and only if | f(p)| <1 holds for every prime. The following 
theorem will be proved: 


If f(n) is completely multiplicative and bounded, and if the function 
F(s), defined fora >1 by 


@ 
(1) F(s) = 3 f(n)/nt, 
has no singular point on o =1, then it has at most one zero on o = 1. 


This assertion is meant to imply that the zero, if any, cannot be a 
multiple zero. That it can occur at all, is shown by Liouville’s example, 
f(p) =—1, where s =1 is a zero of F(s) = €(2s)/f(s). Since this F(s) 
is replaced by /'(s— ta) if every f(p) —=—1 is multiplied by p**, the zero 
can occur at any point, s == 1-++ ia, of the line o—1. 

For reasons of symmetry, the zero, if any, must be at s=1 if f(n) is 
real-valued. In this case, the assumption of boundedness, which is then 
equivalent to —1Sf(p) S1, can be refined to —1Sf(p), if (1) is 
absolutely convergent for o >1. This was proved in [2] by an argument 
based on £(s)F'(s). The above theorem will be proved by combining that 
argument with a device, introduced in this context by Ingham [1], which 
replaces {(s)F'(s) by 


(2) G(s) = ¢?(s)F(s)F*(s), 


where F*(s) denotes the Dirichlet series the coefficients of which are the 
complex conjugates of the coefficients of (1). 

First, if o > 1, then, since | f(n)| <1, logarithmic differentiation of the 
Euler factorization of (1) gives 


* Received June 15, 1951. 
723 


13 


N, 
er 
al 
rk 
ng 
es 
| 
ist 
2, 
3 
1d 
La 
od 
4 
| 


724 AUREL WINTNER. 


(3) — F’/F(s) = A(n)f(n)/n’, where — (’/£(s) = (n)/nt 


Hence, from (2), 


Since F'(s) is supposed to remain regular on o = 1, it is clear from (2) 
that the limit 


(5) m —m(t) = lim ++ it) 


exists for every real ¢ and is the order of the zero s=1-+ 1# of G(s), with 
the understanding that this order can be negative or 0. On the other hand, 
since | f(n)| <1 and A(n) =0, every coefficient of (4) is non-negative. 
Hence it is clear from (5) that | m(t)| S| m(0)| holds for every t. 

In particular m(t) must vanish identically if it vanishes at 0. This 
means that G(s) must be regular and non-vanishing at every s = 1 + 1¢ if it 
is regular and non-vanishing at s 1. But the latter assumption is satisfied 
if F(s) vanishes at s = 1 in the first order. This is clear from (2), since 


f(s) has a pole of first order at s—1. Consequently, if /(s) has a simple Fr 


zero at s =1, then G(s) has no zero s=1- it ~1, which, in view of (2), 
means that s = 1 is the only zero of F(s) on the line o 1. It follows that, 
in order to prove that 


(6) F(1-+ for every if F(1) = 0, 


it is sufficient to show that /'(s) cannot have a multiple zero at s —1. 
Since | f(n)A(n)| = A(n), it is clear from (3) that 


ife > 0. It follows therefore from 
lim ef’/£(1 + =—1 that lim | #”’/F(1+ $1. 


Finally, the last inequality implies that F(s) cannot have a multiple zero at 
s =1, i.e., that 
(7) F’(1) £0 if F(1) =0. 

This proves (6). But (6) implies that 


(8) F(1 1t) £0 for every ta if F(1+ —0, 


H 
$ 
4 
| 
4 
4 
a 
4 


ith 


NON-VANISHING OF CERTAIN DIRICHLET SERIES. 725 


where a is any real number. In fact, (8) follows if f(m) in (1) is replaced 
by f(n)n-** and then(6) is applied to the new function (1). Similarly, (7) 
implies that 

(9) F’(1+ ta) ~0 if F(1 + ta) = 0. 


Clearly, (8) and (9) together are equivalent to the theorem italicized above. 

It is clear from the proof that, instead of assuming the regularity of 
F(s) on o=1, it is sufficient to assume that /(s) and F’(s), where o >1, 
go over into continuous boundary values as o—>1. In fact, a somewhat less 
stringent condition would also suffice. 


THE JOHNS HOPKINS UNIVERSITY. 


REFERENCES. 


{1] A. E. Ingham, “Note on Riemann’s zeta-function and Dirichlet’s L-functions,” 
Journal of the London Mathematical Society, vol. 5 (1930), pp. 107-112. 

(2] A. Wintner, “On the non-vanishing of certain Dirichlet series,” Rendiconti del 

Circolo Matematico di Palermo, New series, vol. 1 (1952). 


= 
ad, 
his 

it 
ied 
1¢ce 
ple 
2), 
at, 

at 

| 


ON THE FUNDAMENTAL GROUP OF AN ALGEBRAIC VARIETY.* 


By We!-Liane CHow. 


It is well known that any 1-cycle in an algebraic surface can be deformed 
into a 1-cycle lying in a generic plane section of the surface. The usual 
proof of this theorem, which can be easily generalized from a surface to any 
non-singular algebraic variety, is topological and consists of a simple con- 
struction of the deformation chain. In the transcendental theory there is a 
generalization of this theorem, at least in its homology aspect, which can be 
stated as follows: * There exist exactly 2p independent 1-cycles in an algebraic 
surface which are not homologous to 1-cycles belonging to a generic curve of 
an irrational pencil of genus p. In this paper we shall show that this theorem 
is a special case of a more general theorem about the fundamental group of 
an algebraic variety under a rational transformation. Our method of proof 
will be purely topological ; the essential idea is that although a rational trans- 
formation is not in general a fibre mapping, the covering homotopy theorem 
is nevertheless true, in a somewhat modified form, for the mapping of a 
1-simplex. 

In section 1 the notion of a fibre system is introduced, and certain sub- 
systems of an algebraic system are shown to be (or can be considered as) fibre 
systems. The notion of a fibre system is a generalization of that of a fibre 
space, and just as in the case of a fibre space we have also here as a funda- 
mental property the validity of the covering homotopy theorem, which must 
now be formulated in a somewhat modified form. This notion of a fibre 
system is a very useful tool in the study of the topology of algebraic varieties; 
in this paper we shall limit ourselves strictly to the particular problem in 
question, but we hope to show in a later paper that the method is applicable 
also to other similar problems in algebraic geometry. In section 2 the results 
of section 1 will be used to prove two theorems; one of them (Theorem 2) is 
the theorem mentioned above, the other (Theorem 1) is a theorem concerning 
the deformation of 1-cycles into a member of an algebraic system with at least 


* Received August 20, 1951. 
1 See, e.g., O. Zariski, Algebraic Surfaces, p. 108. 
*See O. Zariski, Algebraic Surfaces, p. 144. 


726 


THE FUNDAMENTAL GROUP OF AN ALGEBRAIC VARIETY. %27 


one base point, which can also be regarded as a (partial) generalization of a 
result of Severi* (proved by transcendental methods). 


1. Let U and V be topological spaces, and let G(y) be a function which 
assigns to each point y in V a subset G(y) in U. We shall say that the system 
of subsets G(y) defines a fibre system in U, if there exists an open covering 
N = {N} of V such that for each set NW there exists a continuous function 
ov (2, y), defined for all points x C G(N), y C N in the product space U K V 
with values in U, with the following properties: 


on(z,y) C G(y), (xC CN), 
on (2, y) (x C G(y),y CN). 


The space V is called the base space of the fibre system, and the open sets NV 
and the corresponding functions ¢y(z, y) are called the slicing neighborhoods 
and the slicing functions respectively. Let f(z) and g(z) be continuous 
mappings of a topological space Z into U and V respectively, such that for 
each point z C Z we have f(z) C G(g(z)), and let g(z,t), OS ¢S1, bea 
homotopy of the mapping g(z) in V. Then a homotopy f(z,t), OS¢#S1, 
of f(z) in U is said to cover the homotopy g(z, t), or a covering homotopy of 
g(z,t), if we have f(z,t) C G(g(z,t)) for all z,¢. The covering homotopy 
theorem (in the weak form) asserts that if V is a normal Hausdorff space 
and if Z is compact, then there exists always a covering homotopy; further- 
more, if g(z,¢) leaves a point z C Z fixed, we can assume that f(z,¢) also 
leaves 2 fixed. That this covering homotopy theorem is true for any fibre 
system in U can be seen as follows. We observe first that in case the G(y) is 
the inverse function m*(y) of a continuous mapping w(x) of U onto V, then 
we have a (generalized) fibre space as defined by S. T. Hu‘; and, as has 
been observed by Hu, the covering homotopy theorem is true for such a fibre 
space. The general case can be reduced to this special case by considering 
the graph W of the function G(y), i.e. the set of all points wz X y in 
UX V satisfying the condition G(y). Let and be the 


* F. Severi, “ Intorno al teorema d’Abel sulle superficie algebriche ed alla riduzione 
a forma normale degl’integrali di Picard,” Rendiconti del Circolo Matematico di Palermo, 
vol. 21 (1906), p. 261, Teorema I. 

*Sze-Tsen Hu, “On generalizing the notion of fibre spaces to include the fibre 
bundles,” Proceedings of the American Mathematical Society, vol. 1 (1950), pp. 756-762. 
It is convenient to use this generalized notion of a fibre space, though the particular 
fibre systems used in the present paper can all be “derived” from fibre bundles which 
are also at the same time fibre spaces in the sense of Hurewicz-Steenrod. 


al 
n- 
a 
ic 
of 
of 
of 
m 
a 
re 
st 
re 
35 
in 
le 
is 
| 
st 
4 


728 WEI-LIANG CHOW. 


mappings of W into U and V induced by the projections of U X V into U 
and V respectively. It is clear that z(w) is a mapping of W onto V, and 
that we have r(x"(y)) = G(y) for each y C V. The space W can then be 
made into a fibre space with respect to the mapping z(w) if we define for 
each N the slicing function by the formula: 


ov (Ww, ¥) = gn XY (wCa*(N),y CN). 


If we set h(z) =f(z)X g(z), then h(z) is a continuous mapping of Z into 
W such that h(z)C w*(g(z)). Since the covering homotopy theorem holds 
for the fibre mapping z(w), there exists a homotopy h(z, t) of h(z) in W such 
that h(z, t) C w(g(z, t)). Then we have rh(z, t) C rx3(g(z, t)) = G(g(z, t)), 
so that rh (z, t) is a homotopy of f(z) in U which covers the homotopy g (z, t) 
in V. 

[Note added in proof (June 6, 1952). Professor Beno Eckmann has 
recently called my attention to the fact that essentially the same concept as 
that of a fibre system has been introduced by him under the name “ retrahier- 
bare Ueberdeckung ” in his paper “ Zur Homotopietheorie gefaserter Raeume,” 
Commentaru. Mathematici Helvetici, vol. 14 (1941), pp. 141-192. In the 
first part of that paper the covering homotopy theorem was proved for a 
“ retrahierbare Ueberdeckung ”, and an application was made of this theorem 
to the covering of a sphere by its great spheres. ] 


Let U be a non-singular algebraic variety of dimension r, and let ® be 
an irreducible algebraic correspondence of dimension ¢ between U and an 
algebraic variety V of dimension s. Then for a generic point y in V the 
set ®-*() is an irreducible algebraic variety of dimension d= t—s in JU, 
and we can consider ®*(y) as a generic element of an algebraic system of 
d-cycles in U. For any point y in V the variety @-*(y) is the carrier of 
the set of all d-cycles which are specializations of the d-cycle -() over the 
specialization »—>y. A point y in V is said to be semiregular with respect 
to the correspondence ® (or rather the inverse correspondence ®-*), if there 
is a uniquely determined specialization cycle of &*() over the specialization 
7 — y, and if this specialization cycle has no multiple components. It is easily 
seen that a point y in V is semi-regular with respect to ® if and only if the 
variety ®*(y) has the dimension d and the same degree (in the ambient 
projective space) as the variety ®*(7), so that we can consider @-1(y) itself 
as the specialization cycle of @-*() over the specialization »—> y. 

Since U is a differentiable manifold (of class C”), we can introduce in 
U a Riemannian metric. In fact, let M = {M;} be a locally finite system of 


| 
I 
4 
5 
2 
3 
3 
a 
4 4 
3 
i 
| 


THE FUNDAMENTAL GROUP OF AN ALGEBRAIC VARIETY. 729 


coordinate neighborhood covering U, and let the set of differentiable functions 
{e,(z)} be a partition of unity subordinate to this covering; then, if we 
denote by ds;? the Euclidean metric of the coordinate neighborhood M;, the 
differential form ds’ = > e¢;(x) ds; defines a Riemannian metric on U. We 


observe that by means of a suitable choice of the covering neighborhood M; 
and the partition functions e;(~) we can make the Riemannian metric ds? 
in a sufficiently small neighborhood of any given point equal to the Euclidean 
metric with respect to any given coordinate system around this point. This 
fact will be convenient for us later. Let 7 be a semi-regular point in V, and 
let R be a compact subset in @*(y’) consisting of only simple points in 
@1(y’). If p(x) is any differentiable function which assigns to each point x 
in R a (2r— 2d)-dimensional direction element which is transversal to the 
tangent space of ®*(y’) at the point z, then there exists a differentiable 
system of (27— 2d)-dimensional geodesic surfaces (or geodesic (27 — 2d)- 
surfaces) P(x) (a C R), such that for each point z in R the surface P(x) 
has the tangential direction p(x) at x. If N is a sufficiently small neighborhood 
of y’ in V, then for every point y in N and every point z in R, the inter- 
section ®-*(y)M P(x) consists of exactly one point which is simple in ®*(y), 
and the mapping z > @*(y)M P(z) is a homeomorphism of FR onto a com- 
pact subset R(y) in ®*(y). Thus each point z in > F(y) is contained in 


exactly one geodesic (2r— 2d)-surface of the system, which we shall also 
denote by P (a) ; the function ¢y(z,y) = P(z)N R(y) C DR(y),y CN) 
is then a slicing function for the system R(y) (y C NV). ca 

In case the variety ®*(y’) is non-singular, we can set R = @-1(y’) and 
hence R(y) —*(y) for all y in N, so that the system of varieties ®*(y) 
(y C N) defines a fibre system in U. Now, if the generic variety ®*() ts 
non-singular, then there exists a proper subvariety H in V such that every 
point y in V — # is semi-regular with respect to ® and the variety ®*(y) is 
non-singular. It follows then that the system of varieties ®"(y) (y C V — H) 
defines a fibre system in U. We shall now show that in case d=1' this 
assertion is also true (with a suitable definition of H) even if ®*(») has 
singular points. We shall say that a point y’ in V is regular with respect 
to ®, if y’ is semi-regular and if the variety ®-*(y’) has the same singularities 
as the generic variety ®-'(7), i.e. each singular branch of the curve @*(y’) 
is the specialization of a singular branch of the same order of the curve "*(7). 
It is easily seen that the set of all points in V which are not regular with 
respect to ® is a proper subvariety in V, which we shall also denote by H. 
Let y’ be a point in V — H, and let 2’ be a singular point in ®-1(y’) ; then 


U 
be 
for 
nto 
lds 
ich 
t) 
has 
as 
ier- 
the 
em 
be 
an 4 
the 
U, 
of 
of 
the ; 
ect 
lere 
ion 4 
sily 
the i 
ent { 
self 
in 
of 


730 WEI-LIANG CHOW. 


there is a coordinate neighborhood M of 2’ in U and a suitably chosen system 
of coordinates u;,- - -u, in M with origin at 2’, such that for every point y 
in a sufficiently small neighborhood N of y’ in V, the curve $*(y)N M isa 
regular analytic covering space of a fixed degree g over a neighborhhood MU, 
of the origin in the complex u,-plane with a unique branch point over the 
origin Let M’, be a circular region | u,| > 0) such that its 
closure M’, is contained in M,, and let P(a) (« C M,) be the “ hyperplane ” 
in M defined by the equation u,—«. Then, for each u,~0 in M’;, there 
exist g disjoint circular domains P;(ui), i=1,---,g, in P(u:), such that 
for each y in NW and for each 1—1,- - -,g, the intersection ®7*(y)N Pi(u) 
consists of exactly one point, while for u,; = 0, the intersection @7*(y)M P(0) 
itself consists of exactly one point, namely the branch point of #*(y) 


over u,=0. For each y in N, we set L(y) = 2. ®*(y)N P(u) and 


L(y) = >. *(y)NP(u) so that L(y) is the closure of the domain 


L(y) in &*(y). We can then define a slicing function for the system L(y) 
(y C N) by setting gy(z, y) = &*(y)N Pi(u.), (cx C S L(y), y C N), where 
yCN 


P;(u,) is the one of the g domains in P(u,) which contains the point z. 
Furthermore, if we choose our Riemannian metric in U in such a way that 
it coincides with the Euclidean metric in the coordinate neighborhood M, 
then each P;(u:) is a geodesic (2r— 2)-surface, and for each point ~ 72’ 
in L(y’) the P;(u,) passing through it has a tangential direction element p(z) 
which is transversal to the tangential space of ®*(y’) at z. In particular 
this function p(x) is defined on the boundary L(y’) —L(y’) curve of the 


domain L(y’), and it is also differentiable. Let now x, 1=1,---,a, be 
the singular points of @"*(y’), and let M, i= 1,- - -,a, be suitably chosen 
(disjoint) coordinate neighborhoods of the points 7, «== 1,- - -,a, respec- 


tively in U, and let the Riemannian metric in U be so chosen that it coincides 

with the Euclidean metric in each M“. If we choose the neighborhood NV 

of y’ sufficiently small, then we can define a fibre system L(y) (y C ) 

and a slicing function ¢y(z,y) (x C L(y), y CN) in each and 
yc 


we have also the function p(x) defined in L(y’) —L@(y’). If we set 
Q(y) = > L®(y) and Q(y) = > L(y), then we can consider all the functions 
4=1 4=1 
¢y(z, y) together as a single slicing function ¢y(z, y) (« Q(y),y C N) 
yCN 


for the fibre system Q(y) (y C VY), and also all the functions p(x) together as 
a single function p(x) defined on the boundary Q(y’) — Q(y’) of the domain 
Q(y’). For each y CN, let R(y) = ®*(y) —Q(y); it is clear that Q(y) 


4 
| 
a 
| 


THE FUNDAMENTAL GROUP OF AN ALGEBRAIC VARIETY. 731 


and R(y) are two complementary closed domains in @*(y) with the curve 
Q(y) —Q(y) as their common boundary. In particular the boundary of 
R(y’) is Q(y’) —Q(y’), and the function p(«) is defined and differentiable 
on the boundary Q(y’) —Q(y’). If we assign to each point z in R(y’) the 
set of all (27 2)-dimensional direction elements at z which are transversal 
to the tangent space of @*(y’) at x, then we obtain a differentiable fibre 
bundle over the space R(y’), in which the fibre is topologically a (4r — 4)-cell. 
It follows then that the differentiable function p(x), which is defined on the 
boundary of R(y’), can be extended to a differentiable function p’(z) in the 
entire space R(y’). If we denote by P’(r) (x C R(y’)) the system of 
geodesic (27 — 2)-surfaces corresponding to the function p’(x), then for each ¥ 
in NV, provided N is taken sufficiently small, the mapping + > @7(y)N P’(z) 
is a homeomorphism of R(y’) onto a closed domain in ®*(y) whose boundary 
is Q(y) —Q(y) and which approaches R(y’) as y approaches y’; hence this 
domain must be R(y). If we denote by P’(zx), for any point x in > Fy) ; 
yc 


the one geodesic (27 — 2)-surface of this system which contains the point z, 
then the function 


is a slicing function of the system R(y) (y C N). Since, for 


yCN yCN 
we have 
oy (2, y) = = P(x) = 


the two slicing function gy(z, y) and ¢’y(z, y) are concordant whenever both 
are defined and hence can be considered together as one slicing function for 
the system of curves ®"(y) (y CN). Thus we have shown that the system 
of curves ®*(y) (y C V—#) is a fibre system. 


2. THrorEM 1. Let U bea non-singular algebraic variety of dimension 
r, and let a subvariety G of dimension d be a member of an irreducible algebraic 
system G(y) (y CV), with U as its carrier variety, which has at least one 
base point, and whose generic member is irreducible. If f(z) is a continuous 
mapping of the unit interval I into U, with f(0) =f(1) =2® in G, then 
a finite power of this mapping is homotopic rel.z 0,1, to a continuous 
mapping of I into G. If the system G(y) is involutional, then f(z) itself is 
homotopic rel. z = 0,1, to a continuous mapping of I into G. 


i 
em ff 
ty 
sa 
the 

its | 
e ” | 
ere 

at 

0) 

y) 
und 
ain 

2. 
hat 
M, 

+) 
lar 
the 
be | 
sen i 
des | 

ind | 

j 

| 

set | 
N) 

| 
y) 


732 WEI-LIANG CHOW. 


Remark. The m-th power of f(z) is the mapping f(z) of I defined by 
setting f(z) =f(mz—i) for i/m<z2S(i+1)/m, 
The system G(y) is said to be involutional, if it is induced by a rational 
transformation of U onto V. 


Proof. Wet the algebraic system be defined by an irreducible correspon- 
dence © between U and the variety V, and let y be the point in V such that 
b1(y) = G(y) =—G. It is sufficient to prove our theorem for the case 
where y is any point in an everywhere dense subset in V; this follows from 
the fact that for any point y in V the variety ®“(y) is a neighborhood 
deformation retract in U. We begin with the special case where the system 
G(y) is the linear system cut out on U by the system of linear subspaces of 
dimension n —r-+ d (d =1) in the ambient space S,, which all pass through 
a sufficiently general S,_,,¢1. Since in this case ®*(y) is non-singular for a 
generic point » in V, there exists a subvariety H in V such that each pointy |) 
in V — ZH is semi-regular and ®*(y) is non-singular. We can assume without 
any loss of generality that y is a point in V — H; furthermore, we can also 
assume that 2 is a point in G outside of @*(H). Since &*(#) is a proper 
subvariety in U and hence topologically a subcomplex of dimension = 2r — ? 
in the 2r-dimensional topological manifold U, we can assume that, after a [7 
suitable homotopy (rel. z= 0,1) if necessary, f(z) is a mapping of J into 
U—#-(H). Then the mapping g(z) = ®(f(z)) of into is well 
defined, and we have evidently g(0) =g(1) =y® and f(z) C G(g(z)) for 
all z. Let 2® be any point in GM Sy+.a4, and let h(z) be any continuous 
mapping of J into G such that h(0) =2® and h(1) =2™. We set 


f (42) (0<z<}), 
(4253), 
(4 — 42) (S221), 


and 


(42) (0<2<)), 
y (S251); 


it is clear that ~ f(z) rel. z= 0,1, and g’(z) ~0 rel.z=0,1. Since 
f’(z) C G(g’(z)) for all z, it follows from the covering homotopy theorem 
that there is a homotopy of f’(z) rel.z=0,1, which deforms f’(z) into a 
mapping of J into G; hence f(z) is also homotopic rel. z = 0,1, to a mapping 
of J into G. 


& 

FS 
4 

‘a 

| 

| 

4 

! 


THE FUNDAMENTAL GROUP OF AN ALGEBRAIC VARIETY. 733 


Turning to the general case, we observe first that we can assume without 
any loss of generality the following: (1) For a generic point € in U the 
variety ®(é) consists of a finite number m of points; for otherwise we can 
replace V by its intersection with a suitably chosen linear subspace (passing 
through y®) in its ambient space. It is clear that we have m —1 in case 
G(y) is an involutional system. (2) For a generic point y in V the variety 
©+(y) is a curve, i.e. d =1; for otherwise we can replace U by its intersection 
with a suitably chosen Sp-a,1 (passing through a base point of the system G(y) ) 
in S,, and we have just shown above that any continuous mapping f(z) of J 
into U, with f(0) =f(1) in UN Spas, is homotopic rel. z = 0,1, to a con- 
tinuous mapping of J into UM Sha. Now, let H be the subvariety in V 
containing all points which are not regular with respect to ®, so that the- 
system of curves G(y) (y C V—AHZ) is a fibre system; let 7 be the sub- 
variety in U such that for every point x in U —T the set ®(x) consists of 
m distinct points outside of H. Without any loss of generality we can assume 
that y is a point in V—AH such that G=®"*(y) is not entirely in T, 
and that x is a point in G—T; then ®(2) consists of m points, one of 
which is the point y®. Since 7 + 6+*(H) is a proper subvariety in U and 
hence topologically a subcomplex of dimension = 2r — 2 in the 2r-dimensional 
manifold U, we can assume that, after a suitable homotopy (rel. z= 0,1) if 
necessary, f(z) is a mapping of J into U—7T—#"(H). Then the image 
®(f(z)) consists of m distinct mappings of J into V — H, one (and only one) 
of which will be a mapping g(z) such that g(0) =y®. The point g(1) is 
one of the m points in the set (x), though not necessarily the point y. 
It is easily seen that if f(z) is the m-th power of f(z), then the image 
&(f(z)) of f(z) will consist of m distinct mappings of J into V— H, one 
of which will be a mapping g(z) such that g(0) =g(1) =y®. Let 2 
be a base point of the system G(y), and let h(z) be a continuous mapping 
of I into G such that h(0) = and h(1) If we now define f’(z) 
and g’(z) again as before, replacing f(z) and g(z) by f(z) and g(z) respec- 
tively, we can repeat exactly the same argument and conclude that f(z) is 
homotopic rel. z= 0,1, to a mapping of J into G. This concludes the proof 
of Theorem 1. 

In the following we shall denote by F(U) the fundamental group of a 
topological space U, considered as a group of mapping classes with some one 
fixed reference point. If W and X are two subsets in U, then the identity 
mapping of WX into W will induce a homomorphism of F(WM X) into 
F(W), the reference point in both groups being one and the same point in 
WX; we shall then denote by F(W,X) the subgroup of F(W) which is 
the image of F(WA X) under this homomorphism. 


i 
4 
by 
Li 
al 
at 
mn 
m 
of i 
oh 
y 
ut 
so 
er 
9 
to : 
ll 
' 
s 
i 
3 
n 
c 
D 


734 WEI-LIANG CHOW. 


THEOREM 2. Let © be a rational transformation of a non-singular 
algebraic variety U of dimension r onto a non-singular algebraic variety V 
of dimension s, with) the properties: (1) For a generic point yn in V the 
variety ®*(n) ts irreducible, and (2) the set H of all points which are not 
semi-regular with respect to ® is a subvariety of dimension = s—2 in JV. 
Then there is a homomorphism of F(U) onto F(V), and the kernel of this 
homomorphism is the subgroup F(U, ®*(y)), where y is any sufficiently 
general point in V. 


Proof. Let T be the fundamental variety of ® in U; it is well known 
that T has a dimension = r—2. It is clear that the variety ®-*(m) has the 
dimension d=r—~s; let c be the degree of ®-*(») considered as a variety 
in the ambient space S, of U. Let y be a point in V such that *(y) 
is not contained in 7, and let x be a point in @*(y) —T. We shall 
take x as the reference point of the groups F(U), F(U —T), F(®*(y)), 
F(U, (y™)), and F(U—T,&*(y™)), and take y as the reference 
point of the group F(V). The identity mapping of U —T into U induces 
a homomorphism 6 of F(U—T) into F(U); since T is topologically a 
subcomplex of dimension = 27— 4 in the 2r-dimensional manifold U, it is 
easily seen that @ is an isomorphism of F(U —T) onto F(U), and that the 
image of F(U —T,@*(y)) under @ is precisely F(U, *(y)). Since 
is a continuous mapping of U — T into V, there is an induced homomorphism 
¢ of F(U —T) into F(V). We set A = $6, so that A is a homomorphism 
of F(U) into F(V). We shall prove our theorem by showing that (provided 
the point y is sufficiently general) A is a homomorphism of F(U) onto 
F(V) and its kernel is F(U, ®*(y)). 

Let Z be a linear subspace of dimension n —r-+s in the ambient space 
S, of U, such that UNL is a non-singular variety of dimension s. The 
rational transformation ® will then induce a rational transformation ® of 
UNL onto V, and we have $*(y)C ®*(y)N L for every point y in V. 
Since, for a general point » in V, the set *(y) = &*(n)N L consists of ¢ 
distinct points in U —T, there exists a proper subvariety K in V such that 
for every point y in V—K the set “(y) consists of c distinct points in 
U —T (which are then necessarily simple points in the variety ®"(y)). 
We shall also assume that y is a point in V—K and zx is a point in 

Let g(z) be a continuous mapping of the unit interval J into V such 
that 9(0) =g(1) =y. Since K is topologically a subcomplex of dimension 
S 2s — 2 in the topological manifold V of dimension 2s, we can assume, after 


THE FUNDAMENTAL GROUP OF AN ALGEBRAIC VARIETY. 735 


a homotopy (rel.z== 0,1) if necessary, that g(z) is a mapping of J into 
V—K. Since 6*(V—K) is a c-fold regular covering space of V —K, 
the inverse image ®-*(g(z)) consists of ¢ distinct mappings of I into U —T, 
one of which will be a mapping f(z) such that f(0) =a. Let ec —f(1), 
and let f,(z) be any continuous mapping of J into @*(y) —T such that 
f:(0) =a and f,(1) =2. If we set 


ri = {100 

fi1(2z2—1) (4251), 

then we have evidently a continuous mapping of J into U—T such that 

6(f’(z))~g(z) rel. z= 0,1. This shows that A is a homomorphism of F(U) 
onto F(V). 

Now let f(z) be a continuous mapping of J into U such that f(0) = f(1) 
= 2, According to Theorem 1, f(z) is homotopic rel. z= 0,1 to a mapping 
of J into 0 N L, and hence also homotopic rel. z= 0,1 to a mapping of I 
into 67(V —K). We can therefore assume that f(z) is already a mapping 
of IJ into 6-*(V —K); then the mapping g(z) = ®(f(z)) is defined, and 
we have g(0) =g(1) =y. In order to prove that the kernel of A is 
F(U,®*(y)), we have only to show that if the mapping g(z) is homotopic 
to zero rel.z2 0,1 in V, then f(z) is homotopic rel. z 0,1 to a mapping 
of I into 6*(y). Without loss of generality we can restrict ourselves to 
the case d 1; for otherwise we can replace U by its intersection with a 
suitably chosen linear subspace of dimension n —r-+s-+1 which contains 
the space LZ. It is then easily seen that we can assume the subvariety K so 
chosen that every point in V—K is regular with respect to ®; then the 
system of curves 61(y) (y C V—K) is a fibre system. Furthermore, there 
exists a subvariety H’ of dimension = s— 2 in V, which contains H and is 
contained in K, such that for every point y in V — H’ the set &-*(y) consists 
of at most ¢ points, each of which is either a simple point in ®“(y) or a 
singular point of a “ generic ” nature. 

Since K is topologically a subcomplex of dimension = 2s— 2, and H’ 
is topologically a subcomplex of dimension < 2s — 4 in the 2s-dimensional 
manifold V, there exists in V — H’ a homotopy g(z,¢) of the mapping g(z) 
rel.z == 0,1 with the following properties: (1) g(z,t)C V—K for all z 
and ¢ except a finite number m of points g((2i—1)/2m,1), i—1,---,m, 
which all belongs to K—H’; (2) g(i/m,1)—=y® for i—1,---+,m, 
and for (‘—1)/mSzSi/m. Since 
6(V — H’) is an analytic covering space (with branch points) of V — H’, and 
since moreover 6*(V — K) is a regular covering space of V — K, there corre- 


ar 
he 
ot | 
V. 
is 
ly 
| 
y § 
) 
Il 
), 
e 
ag 
a 
is 
d 
0 
e 
e 
f 


136 WEI-LIANG CHOw. 


sponds to the homotopy g(z,¢) a uniquely determined homotopy f(z, ¢) rel. z 
= 0, 1in 6-*(V — H’), with f(z, 0) = f(z) and f(z, t) C t)). There- 
fore, to prove our assertion, it is sufficient to consider the following problem. 
Let g(z) be a continuous mapping of J into V— 4H’ with the properties: 
9(0) =g(1) = y™; g(z) =g9(1—z) for all z; g(z) C V— K for all z A}, 
and g(4) belongs to K—H’. Let f(z) be a continuous mapping of J into 
é1(V —H’) such that f(z) C &“*(g(z)) for all 2; then we have to show 
that f(z) is homotopic rel. z 0,1 in U to a mapping of J into ®*(y). 3 
Let f(4) and y g(4), and let R be a compact neighborhood 
of «® in *(y) which contains no singular points of 6*(y) except FF 
possibly the point z™ itself. Then, as we have shown in the preceding section, 
there exists a neighborhood N of y®, and to every point y in WN there 
corresponds a compact subset R(y) in @7(y) (with R(y™) = RP), such that 
the system R(y) (y C NV’) is a fibre system; furthermore, there is a neighbor- 
hood M of «™ in U such that MN &*(y)C R(y) for all y in N. Let J, 
(e >0) denote the interval |z—4 |< e; and let e be taken so small that 
f(z) C M and g(z)C N for all z in Z,. Since the mapping g(z) of I, into 
N is evidently homotopic rel. z = 4 — e,} +e in N to a mapping of J, into 
the point y = g(4—e) = g(4 + e), it follows from the covering homotopy 
theorem that the mapping f(z) of J. into MM #-*(N) is also homotopic 
rel. z = 4—e, $+ e to a mapping A(z) of I, into If we set 


f(z) (|z—4| =e), 

, g(z) (|2z—4|=e), 

12) = } (|z—4|=e), 

then we have evidently f(z) = f’(z) rel. z =0,1in U and g’(z) ~ 0 rel. z = 0,1 
in V—K. Since f’(z)C ®*(9’(z)) for all z, it follows then from the 
covering homotopy theorem that f’(z) is homotopic rel. z = 0,1 to a mapping 
of I into 6*(y), and hence f(z) is also homotopic rel. z = 0, 1 to a mapping 
of J into 6*(y). This concludes the proof of Theorem 2. 


THE JOHNS HOPKINS UNIVERSITY. 


INDUCED REPRESENTATIONS.* 


By F. I. MAvuTNER. 


1. Introduction; statement of Theorem 1. Let G be a locally compact 
topological group and K an arbitrary (but fixed) compact subgroup of G. 
With every continuous unitary representation u of K in a Hilbert (or finite 
dimensional) space over the complex numbers we can associate a continuous 
unitary representation U of G, the so called induced representation, as follows: 

Consider all those Haar-measurable functions X(g) defined on G with 
values in § for which? 


J, 


where || X(g)|| denotes, for fixed g, the norm of X(g) as an element of the 
Hilbert space §. We restrict ourselves to only those functions X(g) which 
satisfy also 


(1.1) 


Clearly all such functions form a linear space over the complex numbers from 
which we obtain a Hilbert (or finite dimensional) space § if we identify 
functions X(g) which differ on sets of Haar-measure zero, and define an 
inner product (X,Y) in § by 


X(gk) =u(k*)X(g) for all ke K. 


(1.2) (X,Y) (X(9), P(g) 


where (X(g), Y(g)) denotes for, fixed g e G, the inner product in the space §. 
Now define for every ye G a linear transformation U(y) by 


(U(y)X) (9) =X(y"9). 


(1. 3) 


* Received April 19, 1951. 

1 Introduce in 5 an arbitrary complete orthonormal system, and denote by X;(g) 
the expansion coefficients of X(g) with respect to it (for each fixed ge @). We shall 
say that the vector-valued function X(g) is Haar-measurable if each of the complex 
valued functions X,;(g) is Haar-measurable. It is clear that this definition is indepen- 
dent of the particular complete ortiioncrinal system in 5, and that it follows that the 
inner product (X(g),¥(g)) of any two Haar-measurable vector valued functions X(g) 
and Y(g) is Haar-measurable. 


Or 


1. 
re. 

es: 

ito 3 
OW 
od 
pt 
mn, 
re 
at 3 
I 
to 
to 
py 
ic 
iu 
g : 

| 


738 F. I. MAUTNER. 


Clearly U(y) is a unitary operator of the space § onto itself, and the mapping 
U:y—>U(y) defines a continuous unitary representation of G in the space §. 
We call U the induced representation generated by u, and write 


(1. 4) U=indu or U=indu. 
KtG 


For finite groups this is the same as the classical definition of “ induced 
representation.” For an arbitrary locally compact group and a closed sub- 
group a definition has recently been given by Mackey [5]. It is easy to see 
that Mackey’s definition reduces to the above in the case when the subgroup 
K is compact, which we shall assume throughout. 

Suppose for the moment that G is compact too. Then the representation 
U of G can be decomposed into a discrete (i.e. ordinary) direct sum 2 wily 


of irreducible finite dimensional pairwise inequivalent representations M; of 
G, ea:h occurring with multiplicity »; The classical Frobenius reciprocity 
theorem asserts in this case that 


(1.5) multiplicity of u in M,(K) = p;, 


where M;(K) denotes the restriction of the representation M; to the sub- 
group K. 

The problem arises whether this theorem can be generalized to the case 
where G is no longer compact. It is known that U can still be decomposed , 
into irreducible unitary representations M; in the sense of generalized direct 
sums (= direct integrals). Therefore equation (1.5) can still be formulated. 
However * there exist infinite discrete groups for which (1.5) is false even 
when one takes for K the trivial subgroup = (1). 

But one can replace (1.5) by the following formulation, which is equi- 
valent to (1.5) in the classical case (G compact). Denote by Uj; the 
repetition p,; times of M;: 


@ M;. 
Then we have U = > Uj, where U; and U; are inequivalent for 14 j. Denote 
® 
by W; the algebra generated by the operators U;(g) in the representation 


space §; of U;, and by W’; the commuting algebra of W; in §;. Then the 
classical theory of linear algebras tells us that 


2G. W. Mackey has found an explicit decomposition of the regular representation 
of certain discrete groups for which every irreducible component representation occurs 
with multiplicity one, and is infinite dimensional (oral communication of an unpub- 
lished result). 


INDUCED REPRESENTATIONS. 
(1. 6) dim (W’;) = p;’, 


where dim (W’;) = dimension of W’; as a linear space over the complex 
numbers. Then we have 


(1. 7) multiplicity of wu in U;(K) = dim (W’;). 


Moreover it is well known that if £,, E.,- - - denote the minimal self- 
adjoint idempotents in the center Z of W, then W; = E,W, and W’; = E,W’. 
This suggests another possibility of generalization, namely the use of von 
Neumann’s central decomposition. And, in fact, form (1.7) of the Frobenius 
reciprocity theorem can be generalized as follows. Assume now that @ is an 
arbitrary locally compact group whose Haar measure is both left and right 
invariant, and which satisfies the second aviom of countability. Then the above 
Hilbert space § is separable, so we can apply Theorem VII of [15]. Indeed, 
let W be the weakly closed self-adjoint algebra of bounded linear operators 
generated by the operators U(g) in §, W’ the commuting algebra of W, and 
Z the center,, i.e. Z—=WOW’. Then we obtain a direct integral decom- 
position 


(1.8) f_ 


under the operators U(g) to which the center Z “belongs” in the sense af 
Theorems IV and VII of [15]. For each ge G we obtain an operator-valued 
function U(g,¢) of ¢ which can be changed arbitrarily on ¢-sets of measure 
zero. It has been proved in Theorem 1.1 of [8] that one can find for each 
ge G one such operator-valued function U(g,¢), and for each ¢ a continuous 
unitary repersentation U;: g > U;(g) of G in the space &; such that 


(1. 9) U(g,t) =U:(g) for gf 


where NV; is some subset of G of Haar measure zero, depending on ¢. Then 
we have 


THEOREM 1. Let G be a locally compact unimodular group, satisfying 
the second axiom of countability. Let K be an arbitrary compact subgroup 
of G and u: k-—>u(k) a continuous irreducible (unitary) representation of 
the subgroup K. Perform the central decomposition (1.8) of the space § 
of the induced representation U =indu. Denote by U;,(K) the restriction 
of the representation VU; of G to the subgroup K, and by [U;(g) ]’ the algebra 
of all those bounded operators in SH, which commute with U;(g) for 
every geG. 


14 


e 

0 

j 

f 

y 

e 

e 

n 

8 


F. I. MAUTNER. 


ASSERTION. 
(1.10) Multiplicity of u in U;(K) = dim {[U;(g)]’} for almost every t, 


where “dim” denotes the ordinary dimension of a linear space over the 


complex numbers. 


This theorem will be proved in 2 to 5. Some of the lemmas proved in 
2 to 5 may also be of interest independently of the proof of Theorem 1, 
There is some overlap between Theorem 1 above and the generalizations of 
the Frobenius reciprocity theorem recently obtained by Mackey [6]. The 
restrictions under which we prove Theorem 1 above are different from the 
assumption under which Mackey’s results hold. 

This suggests the possibility of a more general result which should 
contain all the known generalizations of the Frobenius reciprocity theorem. 
We shall not discuss this further generalization, which seems to present serious 
difficulties, in the present paper. However Theorem 1 in its present form 
has various applications. The simplest application is to the case where the 
commuting algebra W’ of a given induced representation ind wu is known to 


almost every 4; hence by Theorem 1 the multiplicity of wu in U;(K) equals 1 
for almost every ¢. This result is the main step in the derivation of the 
Plancherel formula outlined in [1la]. The details of this derivation were 
included in the original version of this paper, but have been separated from 
the rest of the paper at the suggestion of the referee. 

Theorem 1 can also be applied to the case where G is a semi-simple Lie 
Group, which for simplicity of statement we take to be its own adjoint group, 
and K a maximal compact subgroup of G. The results of Harish-Chandra 
together with Theorem 1 above imply in this case that the above commuting 
algebra W’ is a finite module over its center for every irreducible continuous 
representation u of K. This implies that the methods and results outlined 
in [11a] apply to the representation space § of ind wu in this case for every wu. 
Thus one obtains by very general considerations a rather special kind of 
Plancherel formula for each §, and hence also for the space 2.(G) of all 
complex valued Haar-Lebesgue square integrable functions on @. 

We shall use in this paper essentially the same terminology and notation 
as in [7] and [8]. In particular Hilbert spaces are always assumed to admit 
complex scalars and will be denoted by §, § or :, and occasionally also by 
H or H;. Whenever we use the results of [15] in an essential manner the 
Hilbert spaces in question have to be assumed to be separable (they may be 


be commutative. For W’ commutative implies dim {[;(g) ]’} =1 for 3 


740 


INDUCED REPRESENTATIONS. 741 


finite dimensional). For the notions and properties of generalized direct 


sums § = J, we refer the reader to [15]. If we are given such a 


generalized direct sum (= direct integral), then there corresponds to every 
ze a vector valued function z(t) such that the value of z(¢) is for a given ¢ 
an element of $;. We call x(t) the “ component” of x in the space §; and 
say also that “a decomposes into x(t).” To define the direct integrals the 
points ¢ must form a measure space. Our assertions about a given direct 
integral will always be about “ almost all ¢ ” or “ almost all the spaces $;” etc. 
by which will be meant “all ¢ (or §; etc.) except for a set of elements ¢ 
whose measure (with which the given direct integral is formed) is zero.” 
I.e. the measure referred to in connection with a given direct integral will 
always be the particular measure used to define the given direct integral, 
even when not mentioned explicitly. 

It has been well known for some time that it is possible to introduce a 
topology on a measure space. It seems unlikely that the introduction of such 
a topology into the measure space used for the given direct integral will 
make it possible to eliminate the “almost all” statement from most of the 
deeper results on generalized direct sums. Thus we shall not introduce 
the above mentioned topology in the present paper, but base our assertions 
and proofs about direct integrals on [7], [8] and [15]. It is however clear 
that in any particular case where a topology is really wanted, our results and 
methods can readily be translated. This remark applies in particular to the 
results outlined in [11a]. There it seems of interest to consider the measure 
space in question more closely. We plan to come back to this in a later 
publication, where it will be shown that the methods of the present paper 
and of [11a] lead to more precise results especially for semi-simple Lie groups 
and throw some new light also on the problem of eigenfunction expansions 
for certain partial differential equations (both of the ellipitic and hyperbolic 
type) especially when there is a continuous spectrum. 


2. Isomorphisms of factors. In this section let = be an arbitrary 
Hilbert space over the complex numbers, and M a factor in #; i.e. M is a 
weakly closed self-adjoint algebra of bounded operators in = whose center 
consists of the scalar multiples of the identity operator J. According to [12] 
there exists on M an essentially unique relative dimension function d(£) 
defined for all projections He M. According to a result of Rickart (Cor. 4. 13 
of [17]) the projections 2 «eM with d(£)<oo are contained in every proper 
two-sided ideal of M. There exists therefore a two-sided ideal J of M con- 


3 
the 
| 
in 
1. 
3 of 
The 
the 
4 4 
yuld 
em. 
ous | 
om 
the |@ 
to 
for | 
the 4 
ere q 
om i 
up, | 
ing 
ous 
1ed 
of 
all 
4 | 
ion 
nit 
by 
he 
be 


%42 F. I. MAUTNER. 


taining all projections of finite relative dimension such that every other non- 
zero two-sided ideal of M contains J. The following lemma is only stated 
explicitly for the convenience of the reader and is not essentially new. 


LemMMA 2.1. M is the smallest weakly closed self-adjoint operator algebra 
which contains the identity operator I and the ideal J. 


Proof. If M is of finite type in the sense of [12], then it has no proper 
two-sided ideals; hence J —M in this case, and hence our Lemma 2.1 is 
(trivially) true in this case. 


If M is a factor of infinite type, then Theorem VIII of [12] implies 
that there exists for every integer n = 0 a projection ¢ M with d(L£,) =n. 
It follows from Lemma 8.13 of [12] that we can assume Ly < Eny:. There- 
fore the Z,, converge strongly to a projection He M. Since d(H —£,)=0, 
we have d(H) = d(E—E,) + n for all n; thus d(Z) =o. Now 
let F be an arbitrary projection M with d(F) =o. By Lemma 8. 13 of [12] 
E and F are equivalent in the sense that there exists a partially isometric 
operator re M such that and F =-x*z, whence Put 
F,=7*E,r. Then strong lim, /,—F and d(F,) =d(fn). So we have 
found for every projection / ¢ M with d(f) oo and ascending sequence of 
projections e¢ J converging strongly to F. This together with the fact that 
M is generated (as a weakly or strongly closed operator algebra) by its pro- 
jection (cf. [16]) proves Lemma 2. 1. 

As an immediate consequence we obtain 


LemMMA 2.2. Let M be a factor and C an arbitrary idempotent element 
of M’, i. e. C? =C is a bounded linear operator, and CA = AC for all Ae M. 
Then the mapping A—>CA is an isomorphism of M onto the algebra CM, 
provided C ~0. 


Proof. The mapping A — CA is clearly a homomorphism of the algebra 
M since C?=C and AC=CA. Hence the set of elements A of M for 
which AC =0 is a two-sided ideal of M. If it were not the zero ideal it 
would have to contain J, by the above mentioned result of Rickart. But 
then Lemma 2.1 above would imply AC = 0 for all A e M, which contradicts 
IC =C 0, since the identity operator J is an element of M. 


3. Decomposition of an invariant subspace. In this section let H be 
an arbitrary (separable) Hilbert space and W an arbitrary weakly closed 
self-adjoint algebra of bounded operators in H. Perform the central decom- 
position (Theorem VII of [15]) under the algebra W: 


3 


on- 
ted 


INDUCED REPRESENTATIONS. 


(3.1) H— 


Now let H = H* = E? e W, where W’ denotes as usual the commuting algebra 
of W. By Theorem V of [15] Ze W’ implies that H is decomposable under 
(3.1) into an operator-valued function, say H(t). Let T, be the set of those 
t for which H(t) 40(t), where 0(¢) denotes the zero-operator in the space 
H;. Put EH and F(t)H; for te T,. Clearly for any re H 
we have ze H, if and only if 


(3. 2) a(t) =O(t) for ¢¢T, 
and 2z(t)eH, for teT, 


(after a possible change of z(¢) on a ¢-set of measure zero which we assume 
to have been made). We have therefore a one-one correspondence between 
the elements x of H, and those equivalence classes of vector valued functions 
a(t) which occur in (3.1) and satisfy the conditions (3.2). It is easy to 
conclude from this that this one-one correspondence defines a direct integral 
decomposition 


(3. 8) H,—f Hy, 
® 


where ¢ now ranges only over the set T, and the measure used in the definition 
of (3.3) is the restriction of the measure used in the definition of (3.1) to 
the subset 

Denote by Z the center of W (Z=—=WOW’), by W, the algebra 
EW, and by Z, the algebra HZ, where the elements of W, and Z, are con- 
sidered to be operators of the space H, into itself. We then have 


Lemma 3.1. The direct integral (3.3) is the central decomposition of 
the space H, under the algebra W, 1. e. the algebra Z, “ belongs to it” in the 
sense of [15]. In particular Z, is the center of W,. 


Proof. Let A, ¢W,; then there exists an element A of W such that A, 
is the restriction of HA to H,. Since Ae W, it is decomposable under (3. 1) 
into an operator-valued function A(t) say. Moreover (HA) (t) = H(t) A(t) 
for almost all ¢ implies 


(3. 4) E(t)A(t)Hu: C Au 


for almost all ¢. Now change A(t) on a set of measure zero so that (3. 4) 
becomes valid for all ¢ and put 


(3. 5) A,(t) = restriction of H(t)A(t) to Hy for te T. 
Then A, decomposes into A,(¢) under (3. 3). 


743 
Dra, | 
; 
is 
n 
re- 
0, 
Ow 
2] 
Tic | 
uve 
at 
ro- 
ont 
M. | 
M, 
ra | 
or | 
| 
ut 
cts 
be 


744 F. I. MAUTNER. 


Let us now choose countably many elements AY (j7 =1,2,---) of W 
which generate W (in the weak or strong topology). It follows from the 
proof of Theorem VI (on p. 459) of [15] that we may assume W(t) to be 
generated by the elements A‘)(¢) for every ¢ where we can also assume that 
the A(t) are chosen such that (3.4) holds for every ¢ and also 


(3.6) <AW(t)H#(t) = for every ¢ and all —1,2,3,---. 


Now let W,(t) be the (weakly closed self-adjoint) algebra generated by the 
operators A,‘(t) for te T,. Then it is clear that since H(t) e W’(t) = W(t)’ 
(cf. Lemma 13 of [15]), the mapping 


(3. 7) 


is a homomorphism of W(t) onto W,(t) for almost every te T;. Since 
E(t) ~0(t) for te T,, and W(t) is a factor for every ¢, we may apply 
Lemma 2.2 above and conclude that the mapping (3.7) is an tsomorphism 
of W(t) onto W,(t) for almost every te T,. Hence in particular W,(t) is 
also a factor for almost every te T;. 

Note that it follows from the way W,(t) is defined that if X is an 
arbitrary element of W,, i.e. X = AH = EA with Ae W, then X(t) = above 
A,(t) is an element of W,(t) for almost every te T,. Observe also that 
W(t) depends measurably on ¢ for ¢e T, in the sense of definition 5 of [15] 
under the direct integral (3.3). Hence in order to prove that (3:3) is the 
central decomposition under W, it is by Theorem VII of [15] sufficient to 
prove the following: If Y(t) is an arbitrary bounded measurable operator 
valued function (in the sense of Definition 5 of [15]) defined for all te T,, 
and satisfies Y(t) ¢e W,(t), then there exists an operator Y of the space H, 
which is an element of W, and decomposes into Y(t) under (3.3) ; i.e. we 
have to prove W, = > W(t) in the terminology of [15] under the direct 
integral (3.3). 

By Lemma 12 of [15] the ring generated by P, and the operators A,“ 
satisfies ~ } W,(t), where P, denotes the ring of those operators of H, 
which decomposes into scalars under (3.3). Since the A,% generate W, it 
remains to prove P, W.. 

C,eP, means that there exists a complex valued bounded measurable 
function c(t) defined for te T,, such that C, decomposes under (3.3) into 
C,(t) =c(t)I,(t), where J,(¢) denotes the identity operator in Hy. Let 
C(t)denote c(t)I(t) or 0(t) according as ¢ is or is not in T,. Then 


A(t) > A,(t) = restriction of H(t) A(t) to Hi; 


(3. 8) C,(t) = restriction of C(t) E(t) to Hy; for te T;, 


a 
i 
} 


he 
be 


INDUCED REPRESENTATIONS. 745 


because J, (¢) = restriction of E(t) to Hi; forte T,. Since T, is a measurable 
set and c(t) a measurable function, the operator-valued function C(t) depends 
measurably on ¢. Hence there exists an operator C in H which decomposes 
into C(t) under (3.1). Since C(¢) is a scalar for every ¢, and since (3.1) 
belongs by hypothesis to Z (which means that Z is exactly the ring of those 
operators in H which decompose into scalars under (3.11)), we have C eZ. 
But Z C W, and hence Ce W. Also (3.8) implies that C, is the restriction 
of CE to H,. This proves C,eW,, i.e. P, CW,. So we have proved 
W,~ > W.(t) in the terminology of [15]. As remarked above, this fact 
proves that (3.3) is the central decomposition of W,. This proves the first 
assertion of Lemma 3. 1. 

But the fact (which we have just proved) that (3.3) is the central 
decomposition of H, under W, implies that P, is the center of W,. On the 
other hand we have just seen that (3.8) implies that the elements C, of P, 
are exactly the elements of the form “restriction of CE to H,” where C 
varies over Z, i.e. the elements of Z,. This proves Z, = P,; hence Z, is the 
center of W,. This proves the second assertion of Lemma 3.1 and hence the 
proof of Lemma 3.1 is complete. 

Later on we shall also require 


LemMA 3.2. Let again E = E* = FE? and Ee W’. Let Cz be the smallest 
projection eZ=—=WOW’ which satisfies Ce D E. Then the restriction of 
the homomorphism X — XE to the subalgebra WCx of W is an isomorphism 
of WCz onto WE. The kernel of the homomorphism X > XE is W(I —Cz). 


Proof. By hypothesis Cz i.e. CoE = hence WE = (WCz)E, 
which proves that the onto-assertion of the lemma is trivial. In order to prove 
that WCz is mapped isomorphically under the homomorphism 


(3. 9) XE, 


it is clearly sufficient to prove that W(I —Cz) is the kernel of (3.9). Let 
Cxz(t) be the operator valued function into which Cz decomposes under (3.1). 
Since CzeZ, and since Z belongs to (3.1), we have as in the proof of 
Lemma 3.1 Cz(t) =c(t)I(t), where c(t) is a numerical essentially bounded 
measurable function defined for all ¢. Since Cg is a projection, we must 
have c(t) = 1 or 0 for almost allt. But implies Cz(t) E(t) = E(t) 
for almost all ¢. Since H(t) ~0(t) for te T,, we must have c(t) —1 for 
almost allteT,. Since H(t) —0(t) for ¢¢7,, we must have c(t) —0 for 
almost all ¢¢ 7,, for otherwise Cz would not be minimal among the central 
projections D £. This proves 


i 
1% 3 
at 
he | 
q 
ce 
ly ; 
m 
| 
wn = | 
at 4 
1e | 
o 
or | 
ly 
j) 
it j 
le 
0 
t 
a 
| 
| 


746 F. I. MAUTNER. 


I(t) for almost all te T, 
0(t) for almost all ¢#T7;. 


Now suppose X is in the kernel of the homomorphism (3. 9), i. e. suppose 
XE=0. This means X(t)H(t) —0(¢) for almost all ¢ under the decom- 
position (3.1). If te T, then Lemma 2.2 implies for almost all ¢ that 
X(t) E(t) =0(t) if and only if X(¢) —0(¢). If on the other hand ¢#T,, 
then X(t)EH(t) =0(t) for any XeW. This proves that XH —0O if and 
only if X(t) —0(¢t) for almost alli¢T,. But by (3.10) this last statement 
is the same as XCz —0, i.e. X(I —Cz) =I, which proves that the kernel 
of the homomorphism (3.9) is exactly W(I—Cz). 

In the course of the proof of Lemma 3. 2 we obtained the following 


(3. 10) 


Corottary 3.1. Let Cy be the smallest projection ¢Z which satisfies 
CyeD LE. Let Cz(t) be the operator valued function into which Cy decom- 
poses under the central decomposition (3.1). Then Cx(t) is given for almost 
all t by (3.10), where T, is again the set of these t for which E(t) ~0(t). 


4. The central decomposition of the regular representation of G. In 
this section we consider the central decomposition of the regular representation 
of our separable locally compact unimodular group G. Denote by R the 
weakly closed self-adjoint operator algebra generated by the right translations 
R(g) in &.(G@), and by L the algebra generated by the left translations L(q). 
Godement and Segal have shown that L and R are each other’s commuting 
algebras : 

(4. 1) L=—R’ and R=L’. 


Denote by Z the center of R: Z—=ROR’—=ROAL=LN VL, and let 


(4. 2) 2,(@) — 


be the central decomposition of 2.(G) under R (or L). 
If a(g) is an arbitrary complex valued Haar-Lebesgue-integrable function 
on G, let R, be the operator acting on 2.(G@) defined by 


(4.3) a(9)R(g)ag. 


Clearly R, is an element of R. Denote by R® the subset of R of the elements 
R, obtained from integrable functions a(g), and by R“”) the subset of those 
R,¢ R© for which a(g) is also square integrable. Under the decomposition 
(4.2) there corresponds to the operator R, an operator-valued function, say 


A 
4 


INDUCED REPRESENTATIONS. 747 


A(t), and to the element a(g) of &.(G) a vector valued function a(t), when- 
ever a(g) (G) Again A(t) and a(t) can be changed arbitrarily 
on sets of measure zero. However we have 


Lemma 4.1. Let a(g)e&i(G)N2%(G). Then there exists one choice 
which assigns to every operator Ra a unique operator valued function A(t) 
and to the “vector” a(g) a unique vector valued function a(t) such that 
for this choice the mapping 


(4. 4) A(t) > a(t) 


is for almost every t a one-one linear mapping of a certain weakly dense 
linear subspace P(t) of the operator algebra R(t) defined below, onto a dense 
linear subspace of $+. 


Proof. It is clear that R is generated by R“”. It follows from p. 386 
of [16] that there exists a sequence of functions a;(g) ©€%i(G@) N 22(@) 
(j=1,2,-- +) which are dense in &.(G@) and for which the corresponding 
operators 


(4. 4a) 


generate the ring R as the smallest weakly closed self-adjoint operator algebra 
containing the operators (4.4a). Now choose for each operator in the 
sequence (4. 4a) one operator valued function A;(¢) into which it decomposes 
under our given direct integral. Then it follows from the proof of Theorem 
VI (on p. 459) of [15] that after one possible change on a set of measure 
zero the factors R(t) into which R decomposes may be assumed to be generated 
by the operators A,(t), A2(t),- - -. Now let A be a (not necessarily commu- 
tative) polynomial 

(4. 4b) P( Raj, * *) 


in a finite number of the operators (4.4a), and define A(t) to be equal to 


(4. 4c) p(Aj,(t), *). 


Denote the family of operators so obtained for each ¢ by P(t). Then P(t) 
is a dense linear subspace of R(t) in the weak topology for operators. 


Now consider all monomials in some of the operators (4.4a). Since to 
the product R,R, corresponds the convolution of the functions a(g) and a’(g), 
ie. RyRy = Ra where a”’(g) = fa(gy*)a’(y)dy and where a” e%,(G) 
whenever a and a’e &,(G) it follows (for instance from Lemma 7.1 of [8]) 
that ae N&(G) and a’ imply a” 
Hence there exists a sequence of functions each €&,(G@) 


8 
se 
at 
1y 
nd 
nt 
l 
es 
on 
ne 
s 
: 
ts 
se 
yn 
Ly 


748 F. I. MAUTNER. 


M2.(G) such that each of the above monomials is equal to one of the 
operators 

Clearly the functions b»(g) are dense in &%,(G@) since the a;(g) are 
contained among the b»(g). Now choose for each bm(g) one vector valued 
function b,,(¢) into which it decomposes under (4.2). It is proved in §6 
of [15] that the finite linear combinations of the bm(t) form a dense linear 
subspace of §; for almost every ¢ and hence, after one change on a set of 
measure zero, for all ¢. Now let z(g) be an arbitrary finite linear combination 
of the b».(g) with complex coefficients Cm, : 


(4.5) = cmbm(9) (r arbitrary <0). 


Define the vector valued function 2(t) by 


then z(g) decomposes into x(t) under (4.2). As remarked above the x(¢) 
form for each fixed ¢ a dense linear subspace of §;, and the operators X(t) 
defined by 


(4.7) X(t) cm Bus (t) 


are exactly the polynomials p(Aj;,(t), Aj,(t),- - +) defined above. Hence the 
X(t) form the dense subalgebra P(t) of R(t). Therefore the proof of 
Lemma 4.1 will be complete if we prove the following: 


The correspondence X(t) <> x(t) between the elements (4.7%) of P(t) 
and the elements (4.6) of $t is a one-one linear mapping for almost every t. 
Denote by 7; the (essentially unique) relative trace which exists for certain 
elements of the factor R(t) in accordance with [13]. According to Lemma 7. 4 
of [8] the factor R(t) will be of type I or II for every ¢ (after a possible 
change on one set of measure zero which we assume to have been made). 
Then there exists a certain function a(¢)>0 and =o such that 


(4. 8) = fa(t)T(X (t) ¥ (t)*)ds(t), 


where ds(t) refers to the measure used to define the direct integral (4. 2). 


On the other hand fz(g)¥(g)dg = f (x(t), y(t))ds(t). Now let C be any 
element of R, it decomposes into a certain operator valued function, say C(t), 


under (4. 2), and we obtain 
(4.9) = (C(t)x(t), y(t) 
= fa(t)T,(C(t) X(t) ¥(t)*)ds(t). 


4 
r 
| 
: 
4 
4 


INDUCED REPRESENTATIONS. 749 


In accordance with Theorem IV and VII of [15] we obtain, as C ranges over 
the center Z of R, for the functions C(t) all essentially bounded scalar valued 
functions c(t)I(t). Hence for Ce Z we obtain 


(e(t)a(t), y(t) )ds(t) = ¥ (t)*)ds(z). 
In this equation c(t) can be the characteristic function of an arbitrary 
s-measurable set, which proves 
(4. 10) (x(t), y(t)) = a(t) ¥ (t)*) 
for all t outside of some set of measure zero which may depend on x(g) and 
y(g). However if we put bm(g) for z(g) and ba(g) for y(g), we get 
from (4. 10) 
(4. 11) (bm(t), bn(t)) = a(t) (t)Bn(t)*), 
for all ¢ outside of one set of measure zero which is obtained by taking the 
union of the countably many sets corresponding to each pair m, n of integers. 
Hence if x(g) and y(g) are finite linear combinations of the functions bm(g), 
then we can define X(t) and Y(t) in terms of the functions Bm(t) by 
equation (4.7), and z(t) and y(t) by equation (4.6), and obtain for this 
particular choice of the functions x(t), y(t), X(t), Y(¢) the truth of (4. 10) 
for all ¢ outside of one sea of measure zero. 

It is proved in Theorem VIII of [15] that the above function a(t) 
(denoted there by a(A)) is positive for all ¢. If we can prove that a(t) 
is finite for almost all ¢ then we may consider the right side of equation 
(4.10) above to be an inner product defined on P(t). The left side of (4. 10) 
is the inner product of the space ;. Hence a(t)<o would imply that the 
correspondence X(t) <> a(t) preserves inner products, hence is one-one and 
clearly linear for almost every ¢ (and therefore for all ¢, after a change on 
one set of measure zero). Therefore the proof of Lemma 4. 1 will be complete 
if we prove 

Lemma 4.2. The function a(t) which occurs in the generalized Peter- 
Weyl-Plancherel formula (4.8) 1s finite and positive for almost every t. 

Proof. As remarked above, a(t) > 0 was proved by von Neumann ([15], 
Theorem VIII). To prove a(t) <0 we observe that for m —n we obtain 


from (4.11) 
|| bn (t) |? = a(t) Bn(t)*)s 


Since || b,(t)\\?<<0, the expression T;(B,(t)B,(t)*) would have to be 0 
for any ¢ for which a(t) =o. But then 


(4. 12) B,(t) =0(t). 


he 
are 
ed 
car 
of 
ion 4 
4 
; 
t) i 
t) 
he 
of 
t) 
lin 
ble 
4 
ny 
@ 


750 F. I. MAUTNER. 


The set of those ¢ for which (4.12) holds is known to be measurable (cf. 
[15], § 13); hence so is the set of those ¢ for which (4.12) holds for all 
n==1,2,---. On the other hand the B,(t) generate the ring R(t), and 
R(t) contains the identity operator J(¢) for almost every ¢. Hence (4. 12) 
can be true only on a ¢-set of measure zero. Therefore a(t) <oo for almost 
every ¢ as required. This completes the proof of Lemma 4. 2 and hence also 
the proof of Lemma 4. 1. 


In the next section we shall require a special case of the following 


LemMMA 4.3. Let B be a countably additive complex valued set function 
on G. For y(g), a suitable complex valued function on G, put 


Assume 8 1s such that Rg and Dg are bounded linear operators on &2(G) 
defined for all y(g)eX.(G) and that Rg and Lg are defined for all 
y(g) and satisfy (G) C and (G4) C 2(G). Denote 
by Rg({t), Lg(t) the operator valued functions into which the operators Rg, Lg 
respectively decompose under the central decomposition (4. 2). 


ASSERTION. The mapping (4.4) can be taken to be such that under it 
there corresponds for almost every fixed t to the operator A(t)Rg(t) the 
element Lg(t)a(t) of S:, and to the operator Rg(t) A(t) the element Rg(t)a(t) 
of Dt. 

Proof. Observe first that the definition (4.12) of Rg, together with the 
assumption that Rg be a bounded operator, implies Rge R; similarly Lge L. 
Hence Rg and Lg are decomposable operators under (4.2). Therefore the 
above function Rg(t) and Lg(t) exist. 


Notice next that the definition (4.12) implies that 
(4. 13) = Rigs and = Riga 


Hence for all ¢ outside of a set of measure zero (which may depend on the 
function a(g) and the measure 8) we obtain 

(4. 14) Ra(t)Rg(t) = Rrja(t), and Rg(t)Ra(t) = Raga(t), 

where R,(t),- + - are arbitrary operator valued functions into which the 
operators Ra,- - - decpmpose under (4.2) (arbitrary in the sense that they 
may be changed arbitrarily on sets of measure zero). Now put in equations 
(4. 14) the function 6,(g) instead of a(g) and replace the “ arbitrary ” R,(t) 
by the function A(t) introduced in the proof of Lemma 4.1. By taking the 
union of countably many sets of measure zero we then obtain from (4. 14) 


4 


4 
4 
| 
3 
| 
| 


on 


@& 


INDUCED REPRESENTATIONS. 


A(t)Rp(t) —Rrga(t), and Rp(t)A(t) — Rrpo(t), 


for all ¢ outside of one set of measure zero whenever a(g) is a finite linear 
combination of the functions 6n(g). By hypothesis the functions (Lgqa) (9) 
and (Rga)(g) are elements of L,(G)M L.(G@) whenever a(g) eLi(G) NM L.(G@). 
Since the measure f is fixed, we may assume that the functions bn(g) are 
chosen such that Lgb, = by and Rgb, = bn” are again functions of the sequence 
b.(g),bo(g), Put (Lea) (g) =pa(g) and (Rea) (g) —ag(g), and let 
Ag(t), gA(t) be the operator-valued function of Lemma 4.1 into which Raga, 
Riga decompose respectively. Then we get all ¢ outside of one set of measure 
zero 

(4. 15) A(t)Rg(t) = gA(t), and Rg(t) A(t) = Ag(t). 

On the other hand (Lga)(g) decomposes into the vector valued function 
(Lga) (t) = ga(t) say, for almost all ¢, and Rga into Ag(t)a(t) = ag(t) for 
almost all ¢, where ga(t) and ag(t) are the vector valued functions uniquely 
determined above by a(g) for which Lemma 4.1 is true. 

Moreover, since we may assume Lgb, = Dd, and Rgbn = bya» we see that 
the operators gA(t) and Ag(t) are elements of P(t); therefore the mapping 
(4.4) is defined for gA(t) and Ag(t), and for all ¢ outside of one set of 
measure zero we obtain 


pA(t) —>ga(t) and Ag(t) > a,(t), 


under the mapping (4.4). Combining this with (4.15) we obtain the truth 
of Lemma 4. 3. 


5. Completion of the proof of Theorem 1. Let us now consider the 
repersentation U = ind u of @ defined in 1. If the representation wu: k — u(k) 
of K is a discrete direct sum of representations u,, u(k) = ~ u,y(k), then it 


follows that U is the direct sum of representation U, of G, where U, = ind uy: 
(5.1) ind { ,w,} => , ind w,. 

Indeed if the representation space h of w is a direct sum 2b of invariant 


subspace h,, then consider those functions X(g) ¢§ for which X(g) is, for 
each fixed g e G, an element of h, They form a closed invariant linear sub- 
space §, of § under the operators U(g), and if U,(g) denotes the restriction 
of U(g) to §, then the definition of induced representation as given in 1 
implies that U, is (up to unitary equivalence) equal to indu,. This proves 
(5.1). 

In particular let A be the left-regular representation of K. Then 


| 
751 
cf. 
all 
nd 
2) 
ost 
Iso 
3 
1) | 
ll 
te 
it 
e 
) 
e 
| 


752 F. I. MAUTNER. 


4 = > h,u,, where u, varies over all irreducible representations of the compact 
® 


group K (equivalent representations being identified) ; u, is not equivalent 
to uy» for v=<v’, h, denotes the degree of u, and h,u, denotes the repetition 
of the representation u,, h, times. Hence (5.1) gives 


(5. 2) ind A = ind 
KtG ra) 


On the other hand the definition of induced repersentation implies that if 
is the regular representation of K (i.e. A(k) is left translation by the element 
k in the space &,(K)), then Z ind is the (left-) regular representation 
of G. To see this, observe that the definition of induced representation implies 
that the repersentation space for indA can be identified with the space of 
those elements 7(k,g) of 2:(K)X &.(G@) which satisfy 


(5. 3) k’g) =2(kk’, g). 


Hence the square norm equals 


where we take the Haar measure dk on K to be normalized so that f dk = 1, 
This proves that if x(k, g) then the mapping 
(5. 4) x(k, g) > (1,9) =2'(9) 
is a unitary mapping of §) onto &%.(G@). Also the mapping (5.4) clearly 
commutes with left translations by elements of G, which proves that ind A is 
(up to unitary equivalence) equal to the left regular reprseentation L of G. 
The reader who is worried about the fact that the functions x(k, g) are, as 
elements of &.(K)X &.(G@), only defined up to sets of measure zero, may 
restrict himself at first to functions 7(k,g) which are continuous in k, and 
then extend the isometry (5.4) uniquely to a unitary mapping from §) 
onto &.(G@). 

Thus (5.2) can be written as 


(5. 5) L=)>h,U,, where U, = ind wu. 
Let x, be the character of w,, i. e. x»(k) = trace of u,(k). Then the operator 


is a projection; here p(k) denotes right-translation by the element & of K 
in the space 2,(K). The subspace A,,%.(K) of &.(K) is representation 
space for the representation h,u, of K, as is well known. 


f 
q 
= 
4 
% 
5 
3 
a 


act 


INDUCED REPRESENTATIONS. 753 


Therefore the space of those elements r(k,g) of 2,(K)X &%(G@) which 
satisfy (5.3) and hy fxy(k)x(k’k*, g)dk = x(k’, g) can be taken to be repre- 
sentation space for indh,u, under right translations by the elements of G. 
Under the unitary mapping (5.4) this space is mapped onto the subspace 
of all those elements z’(g) of &.(G@) which satisfy 


(5.6) hy dk = 
i.e, (Ryyt’)(g) =2’(g), if f xo(k) R(k) dk. 
This proves 


LemMA 5.1. Let u be any continuous irreducible (unitary) represen- 
tation of degree h of the compact subgroup K and x the character of u. Then 
the subspace R,&.(G@) is a direct sum of h subspaces each of which transforms 
under left translation by elements of G equivalently to the operators of the 
representation U =indu of G.® 


Indeed, since K is compact, any continuous irreducible unitary representation 
wu of K is equivalent to one of the above u,, as we have observed above. 

Let us now keep the irreducible representation u of K with character x 
fixed. Perform as in 4 the central decomposition (4.2) of the space &.(G) 
under the operators R(g), geG. The operator R, defined by the last of 
equations (5.6) is a projection which commutes clearly with L(g) for every 
ge G, hence also with every element of the ring L which the operators L(g) 
generate. Hence we may apply Lemma 3.1 and conclude that if ¢ is an 
element of a certain measurable set (of positive measure) T,, then the space 
Rh, (t)Sz may be identified with the space ,; obtained by performing the 
central decomposition of the space R,&.(G@) under the operators R, L(G) 
of the representation hU = ind(hu). Here T, is the set of those ¢ for which 
Ry (t) ~0(t). 

Put 


(5.7) x(k) L(t) dk, 


and denote by L,(¢) the operator-valued function into which the operator L, 
decomposes under the central decomposition (4. 2). 


* Note that it follows from this together with Lemma 7.1 of [8] that factors of 
type III cannot occur (except on a set of measure zero) in the central decomposition 


of ind wu, whenever K is compact and G@ unimodular and separable. 
K+tG 


4 
q 
ant 
on 
nt 
on 3 
es 
O 
3 
a 
i 
. 
iS ‘ 
s 
y 


754 F. I. MAUTNER. 


Lemma 5.2. Denote by R(t) the factor into which the ring R (generated 
by the right translation R(g)) decomposes for almost every t under the central 
decomposition (4.2). Then the restriction of the mapping X(t) >2(t) 
defined by (4.4) to those elements X(t) which satisfy X(t) = R,(t)X(t)R,(t) 
is for almost every t a one-one linear mapping between a dense linear 
subspace of R,(t)R(t)R,(t) and a dense linear subspace of the space 
Hence tn particular 


(5.8) dim (t)R(t)R,(t) = dim R, (t) L(t) 


where dim denotes the (ordinary) dimension of a vector space over the field 
of complex numbers. 


Proof. Replace the measure 8 of Lemma 4.3 by the measure hx(k) dk, 
where dk refers—as throughout—to the Haar-measure on K. Then the hypo- 
theses of Lemma 4. 3 are readily seen to be verfied by the operators R, and 
L,. Hence if P(t) denotes again the dense subalgebra of R(t) introduced 
in Lemma 4.1, then Lemma 4.3 tells us that under the mapping (4. 4) 
the elements of P(t)R,(t) are mapped into L,(t)+, and the elements of 
R,(t)P(t) into R,(t)$: Hence (t)P(t)R,(t) is mapped into the sub- 
space R,(t)L,(t): of S (for almost every t). Denote by §;° the image 
of P(t) (in +) under the mapping (4.4). Then §;° is a dense subspace 
of $; and the image of R,(t)P(t)#,(t) under the mapping (4.4) consists— 
as we have just seen—of R,(t)L,(t)$:°. Hence clearly Rk, (t)P(t)R,(¢) 
is a dense subalgebra (in the weak topology for operators) of R,(¢)R(t)R,(t), 
and R,(t)L,(t)$2° a dense linear subspace of R,(t)L,(t)$:, which proves 
Lemma 5. 2. 


Let us now consider the space R,(t)$:. It is an invariant subspace of 
§; under the elements of the algebra &(¢) as follows from Lemma 13 of [15]. 
By Theorem 1.1 of [8] there exists a continuous unitary representation V; 
of G, Vi: g > V:(g), where the operators act in the space §; and generate 
the ring £2(¢) according to Lemma 1.2 of [8]. So we have 


Ry (t)Vi(g) = Vi(g) 
for all g and all ¢ outside of one set of measure zero which is independent 
of g. Therefore the operators R,(t)V:(g) form a representation of G in the 
space for almost every 7. 
We shall now prove that 


for almost every ¢. Let yn(g) be a sequence of elements of 2,(G) such that for 
the corresponding operators L,, acting on 2,(G@) we have strong lim L,, = I. 


4 
3 


ted 
ral 
(t) 


| 
| 


of operators of the space R,(t)$z ts for almost every t exactly the set of those 


INDUCED REPRESENTATIONS. 755 


Denote by Ly,(¢) the operator valued function into which L,, decomposes 

under the central decomposition (4.2). In accordance with p. 442 of [15] 

there exist a subsequence of the y, for which the Ly,(t) converge strongly 

to the identity 7(¢) for almost every ¢. Let us assume that this subsequence 

has been chosen and denote it again by yn. Since Inf yn(g)L(g)dg 
G 


by definition, we have for almost every ¢ 


Ln(t) yn(9)U (95 t)dg. 
On the other hand it has been shown in § 1 of [8] that 


Ln(t) Velg)dg 
for almost every Now put zn(g) = nf Yyn(gk*)x(k)dk; then 
K 
strong lim L,, = strong lim Ly,L, = Ly, 
since Lz, = Ly,L,. Also Lz,(t) = Ly,(t)L,(t) for almost every ¢ implies 


strong lim L,,(¢) strong lim an(g)U (g, t)dg 


— strong lim dg = Ly (#) hy(k) U (k, t) dk. 
G 


But since the operators V;(g) form a representation of G, the definition of 
Zn(g) implies 
S%n(9) Vi(9) dg Ly (t)h f x(k) Vi(k) dk. 
Hence strong lim L,,(¢) I(t) for almost every ¢ implies 
strong lim f 2n(g)Vi(g) = L,(t). 

for almost every ¢. This proves equation (5.9) for almost every f. 

From (5.9) we infer 
(5.10) Ry, (t) Ly (t) = (t) Ry (t) =h By (t) dk 
This proves 

Lemma 5.3. The subspace Lx(t)R,(t)S: of Ry (t) Hz ts for almost every 
t the sum of all those subspaces of R,(t): which transform equivalently to 
the repersentation u of K under the operators R,(t)V:(k), forkeK. Here 
R,(t)Vi(k) is considered as an operator of the space R,(t) St. 
Next we have 


Lemma 5.4, The subalgebra R,(t)R(t)R,(t), considered as an algebra 


: 
4 
la 
k, 
d | 
of 
) 
a5 
of 
|. : 
e 
it 


756 F. I. MAUTNER. 


bounded operators of the space R,(t)Sz which commute with (the restriction 
of) R,(t)Vi(g) (to the space R,(t)+) for every geG. . 


Proof. According to (4.1) the ring R is the commuting algebra of L 
in the space &.(G). Hence if we apply Lemma 13 of [15] to our above 
central decomposition of 2.(@), we may conclude that R(t) is the commuting 
algebra of L(t) for almost every ¢. By §1 of [8] we know that after 
omission of one ¢-test of measure zero we have V;(g) e L(t) for all geG, 
and that the V:(g) generate L(t). But all this together with R,(¢) e R(t) 
implies Lemma 5. 4. 


Let us now combine Lemmas 5.1, 5.2, 5.3 and 5.4. By Lemma 5.3 
we know that L,(t)R,(t)z is the sum of all those subspaces of R,(t) 9: 
which transform equivalently to the given repersentation u(k) of K under 
the operators V;(k) for ke K. Hence we have 


(5.11) dim [Z,(t)R,(t)$:] =h- [multiplicity of u in R,(t)V:(K)]. 
Hence Lemmas 5. 2 and 5.4 imply 
(5.12) [multiplicity of u in R,(t)V:(K)], 


where [R,(t)V:(g)]’ denotes the commuting algebra of the operators 
R,(t)V:(g) in the space R,(t)H:. 

Now let us consider the operators U(g) of the induced representation 
U = ind wu acting in the space § as defined in 1. The representation u of K 
being irreducible, we can apply Lemma 5.1 and conclude that the space 
R,X%.(G) can be identified with Kronecker product } X § such that R,L(g) 
becomes identified with I, X U(g), where J, denotes the identity matrix in 


the h-dimensional space h. If § = f &; is the central decomposition of § 


under the operators U(g) introduced in 1 (cf. equation (1.8)), then von 
Neumann’s result on the essential uniqueness of the central decomposition 
(cf. loc. cit.) implies that the spaces obtained from the central decomposition 
of R,&.(G@) can be identified with the spaces } X $+ in such a manner that 
in particular each (measurable) operator-valued function in the one decom- 
position goes over into the corresponding operator-valued function in the 
other decomposition (neglecting of course sets of measure zero again). On 
the other hand we know from Lemma 3.1 that the spaces obtained from the 
central decomposition of R,&.(@) may be identified for ¢¢ T, with the spaces 
R,(t)9:, where the $; are the component spaces of the central decomposition 
of 2.(G@) itself, so that corresponding operator-valued functions go again over 


& 
| 
| | 
be 
| 


INDUCED REPRESENTATIONS, 757 


into each other. Hence we see that there is a one-one correspondence t’ <> ¢ 


between almost all elements ¢’ which occur in (1.8) and almost all those 
elements ¢ which occur in (4.2) and are elements of T,. Here T, is the set 
of those ¢ for which R,(t)40(t). Moreover this one-one correspondence 
t’<»t is such that for each corresponding pair ?¢’, ¢ there exists a unitary 
operator J(?’,¢) mapping §;, onto #,(t)+ in such a manner that 


(5. 18) Ry (t)Vi(g) =F (t,t) Un X Ur (g) JI 
Here the operators Uy(g) are the operators of the unitary representation 
of G acting in #, as introduced in 1 (compare equations (1.8) and (1.9)). 
From (5.13) we infer at once 
(5. 14) dim (¢) Vi(g) = dim (9) ]’} 
for the dimensions of the commuting algebras of the operators R,(t)V:(g) 
and respectively. 
Also (5.13) obviously implies 
(5. 15) h?- [multiplicity of wu in (K) ] 
= multiplicity of in R,(t)Vi(K). 


Hence combining (5.12) and (5.15) we get 


(5.16) dim {[R,(t)Vi(g)’} = [multiplicity of u in Up (K)] 


for each corresponding pair v’,¢. Hence (5.14) and (5.16) together imply 


for almost every ?’ 
multiplicity of wu in Uy(K) = dim {[Up(g) ]’}. 


Since equation (5.17) is exactly the assertion of Theorem 1, i. e. equations 
(5.17) and (1.10) are identical, the proof of Theorem 1 is herewith 
completed. 


Remark. In the course of this proof we have obtained somewhat more 
information than Theorem 1 asserts. For instance if we combine Lemmas 5. 2, 
5.3 and 5.4 we see that there is a natural linear one-one mapping between a 
dense linear subspace of the sum of those subspaces of R(t) which trans- 
form equivalently to w(k) under the subgroup K and a dense linear subspace 
of the commuting algebra of the operators R,(t)V:(g) in the space R, (t)H:. 


THE JOHNS HOPKINS UNIVERSITY. 


} 
4 

| 
ig 
| 
| 


F. I. MAUTNER. 


BIBLIOGRAPHY. 


[1] E. Cartan, “Sur la determination d’un system orthogonal complet dans un espace 
de Riemann symmetrique clos,’ Rendiconti del Circolo Matematico di 
Palermo, vol. 53 (1929), pp. 217-252. 

, “Les espaces Riemannienes symetriques,”’ Verhandlungen des Inter- 
nationalen Mathematiker-Kongresses, Ziirich, 1932, vol. 1, pp. 152-161. 

[3] , “Sur les domaines bornés homogénes de l’espace de n variables com- 
plexes,” Abhandlungen aus dem Mathematischen Seminar der Hansischen 
Universitat, vol. 11 (1936), pp. 111-162. 

[4] I. M. Gelfand, “Spherical functions in symmetric Riemann spaces,” Doklady 
Adademiia Nauk SSSR, vol. 70 (1950), pp. 5-8. 

[5] G. W. Mackey, “Imprimitivity for representations of locally compact groups. I,” 
Proceedings of the National Academy of Sciences, vol. 35 (1949), pp. 
537-545. 

[6] , “Imprimitivité pour les representations des groupes localement compact 
II,” Comptes Rendus, vol. 230 (1950), pp. 808-809, ITII., vol. 230 (1950), 
pp. 908-909. 

[7] F. I. Mautner, “ Unitary representations of locally compact groups I,” Annals of 
Mathematics, vol. 51 (1950), pp. 1-25. 

[8] , “Unitary representations of locally compact groups II,” ibid., vol. 52 
(1950), pp. 528-556. 


[9] , “The structure of the regular representation of certain discrete groups,” 
Duke Mathematical Journal, vol. 17 (1950), pp. 437-441. 
[10] , “Infinite dimensional irreducible representations of certain groups,” 


Proceedings of the American Mathematical Society, vol. 1 (1950), pp. 
582-584. 


[11] , “On the decomposition of unitary representations of Lie groups,” Pro- 
ceedings of the American Mathematical Society, vol. 2 (1951), pp. 490-496. 


[lla] , “Fourier analysis and symmetric spaces,” Proceedings of the National 
Academy of Sciences, vol. 37 (1951), pp. 529-533. 


[12] F. J. Murray and J. von Neumann, “ Rings of operators,” Annals of Mathematics, 
vol. 37 (1936), pp. 116-229. 


[13] J. von Neumann, “On rings of operators III,” ibid., vol. 41 (1940), pp. 94-161. 
[14] F. J. Murray and J. von Neumann, “On rings of operators IV,” Annals of Mathe- 
matics, vol. 44 (1943), pp. 716-808. 
J. von Neumann, “ On rings of operators. Reduction Theory,” ibid., vol. 50 (1949), 
pp. 401-485. 
——, “Zur Algebra der Funktionaloperationen und Theorie der Normalen 
Operatoren,” Mathematische Annalen, vol. 102 (1929), pp. 370-427. 
C. E. Rickart, “ Banach algebras with an adjoint operation,” Annals of Mathe- 
matics, vol. 47 (1946), pp. 528-549. 
C. L. Siegel, “‘Symplectic Geometry,’ American Journal of Mathematics, vol. 65 
(1943), pp. 1-86. 
H. Weyl, “ Harmonics on homogeneous manifolds,” Annals of Mathematics, vol. 35 
(1934), pp. 486-499. 
A. Wintner, “ On the location of continuous spectra,” American Journal of Mathe- 
matics, vol. 70 (1948), pp. 22-30. 


©, 


/ 
4) 


